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Novel method for identifying antibacterial compounds 

The present invention relates to a method for identifying an antagonist or inhibitor 
of the expression of a gene encoding a polypeptide essential for bacterial growth 
or survival as well as for an antagonist or inhibitor of said polypeptide. The 
invention further relates to a method for improved antagonists or inhibitors. The 
invention also provides an antagonist or inhibitor of the activity of said 
polypeptide. The invention is further related to a method for producing a 
composition comprising said antagonist or inhibitor. Furthermore, the invention is 
related to the use of the polypeptide and the antagonist or inhibitor as well as to a 
method to identify a surrogate marker. 

Several documents are cited throughout the text of this specification. Each of the 
documents cited herein (including any manufacturer's specifications, instructions, 
etc.) are hereby incorporated by reference; however, there is no admission that 
any document cited is indeed prior art of the present invention. 

Since the beginning of the 1980s, a new trend has been observed in the 
industrialized countries. On the one hand, resistances to antibiotics have 
increased, which make it difficult or even impossible to treat many of the disease- 
causing agents. On the other hand, new infectious diseases, which had been 
unknown up to now, arise, and old diseases return. For example, diphteria and 
tuberculosis are old epidemics and increasingly surmounting in many different 
parts of the world. Especially tuberculosis (TB), a chronic infectious disease that is 
generally caused by infection with Mycobacterium tuberculosis, is a disease of 
major concern. Each year, 8 to 10 million new cases of TB are described, and, 
causing more than three million deaths per year, TB is a major disease in 
developing countries as well as an increasing problem in developed areas of the 
world due to, for example, antibiotic resistance. 
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Additionally, M. bovis BCG vaccination has failed to protect against TB in several 
trials (WHO, Tech. Rep. Ser. (1980), 651, 1-15) for reasons that are not entirely 
clear (Fine, Tubercle 65 (1984), 137-153). It has been shown that the vaccine 
strain of M. bovis BCG only confers protection against the severe form of miliary 
tuberculosis in children (Fine, Lancet 346 (1995), 1339-1345). In contrast, its 
protective capacity against the most common form, pulmonary tuberculosis in 
adults, is low and highly variable (Colditz (1994), JAMA 271, 698). 

The causes for this new trend are complex: mainly, the increasing number of 
antibiotic applications in medicine and agriculture often combined with an 
improper und uncontrolled use, helps to establish resistant organisms and 
generate the threat of bacterial infections resistant to all available therapies. 
Conventional techniques of developing antibiotics, i.e. synthesis of candidate 
substances and screening for antibacterial substances, even though speeded up 
by several orders of magnitude by the use of combinatorial approaches in recent 
years (e.g. US5324483, US5545568), are still too inefficient as they involve 
multiple screening steps of hundreds or thousands of more or less randomly 
chosen substances for efficiency in combating various infectious agents. 
Therefore, it is a major concern to fight the growing number of bacterial infections 
due to an increased frequency of multiple antibiotic resistances and to improve 
the available antibacterial therapies. 

Thus, the technical problem underlying the present invention was to provide a 
method and means for the development of an additional effective antibacterial 
therapy of infected humans and animals that can be used for the treatment of a 
broad spectrum of bacterial infections or diseases or disorders related to bacterial 
infections. The solution to this technical problem is achieved by providing the 
embodiments characterized in the claims. 

Accordingly, the present invention relates to a method for identifying an antagonist 
or inhibitor of the expression of a gene encoding a polypeptide essential for 
bacterial growth wherein said gene is selected from the group consisting of ygbB, 
yfhC, yacE, ychB, yejD, yrfl, yggj, yjeE, yiaO, yrdC, yhbC, ygbP, ybeY, gcpE, 
kdtB, pfs, ycaJ, b1808, yeaA, yagF, b1983, yidD. yceG and/or yjbC the sequence 
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of said genes being shown in Fig. 1, or a fragment or derivative or ortholog 
thereof, said method comprising the steps of 

(a) testing a candidate antagonist or inhibitor or a sample comprising a plurality 
of said candidate antagonists or inhibitors for the inhibition or reduction of 
transcription of said gene or a fragment or derivative thereof; or 

(b) testing a candidate antagonist or inhibitor or a sample comprising a plurality 
of said candidate antagonists or inhibitors for the inhibition or reduction of 
translation of mRNA transcribed from said gene or a fragment or derivative 
thereof; and 

(c) identifying an antagonist or inhibitor or a sample comprising a plurality of 
said candidate antagonists or inhibitors that tests positive in step (a) and/or 
(b). 

The term "antagonist" or "inhibitor" as used herein means naturally occurring and 
synthetic compounds capable of counteracting or inhibiting an activity of a gene or 
gene product or interactions of the gene or gene product with other genes or gene 
products. Determining whether a compound is capable of inhibiting or 
counteracting specific gene expression can be done, for example, by Northern 
blot analysis, Western blot analysis or proteome analysis. It can further be done 
by monitoring the phenotypic characteristics of a bacterial cell contacted with the 
compounds and compare it to that of a wild-type cell. In an additional 
embodiment, said characteristics may be compared to that of a cell contacted with 
a compound which is either known to be capable or incapable of suppressing or 
activating the protein or gene, respectively, according to the invention. For 
example, the bacterial cell can be a transgenic cell and the phenotypic 
characteristics comprises a readout system. Further examples of determining 
whether a compound is capable of inhibiting or counteracting specific gene 
expression are described below. 

The term "expression" means the production of a protein or nucleotide sequence 
in a cell. However, said term also includes expression of the protein in a cell-free 
system. It includes transcription into an RNA product, and/or translation into a 
polypeptide from a DNA encoding that product. 

The term "transcription" as used herein means a DNA template dependent 
synthesis of a ribonucleic acid polymer encoding a polypeptide or a regulatory 
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sequence. The term "translation" as used herein means the polymerization of a 
polypeptide that is encoded by an RNA molecule by a protein complex. 
As used in accordance with the present invention, the term "fragment or 
derivative" denotes any variant the amino acid or nucleotide sequence of which 
deviates in its primary structure, e.g., in sequence composition or in length as well 
as to analogue components. For example, one or more amino acids of a 
polypeptide may be replaced in said fragment or derivative as long as the 
modified polypeptides remain functionally equivalent to their described 
counterparts. The term "fragment or derivative" further denotes compounds 
analog to an antagonist or inhibitor that should have a stabilized electronic 
configuration and molecular conformation that allows key functional groups to be 
presented to the mentioned polypeptide in substantially the same way as the 
antagonist and inhibitor. The variant of the polypeptide may be a naturally 
occurring allelic variant of the polypeptide or non-naturally occurring variants of 
those polynucleotides. 

The term "orthologs" as used herein means homologous sequences in different 
species that evolved from a common ancestoral gene by speciation. Normally, 
orthologs retain the same function in the course of evolution. However, 
orthologous genes may or may not be responsible for a similar function (see, e.g., 
the glossary of the "Trends Guide to Bioinformatics", Trends Supplement 1998, 
Elsevier Science). Orthologous genes, nucleic acids or proteins comprise genes, 
nucleic acids or proteins which have one or more sequences or structural motifs in 
common. For example, the sequence motifs of proteins can comprise short, i.e. 
repetitive sequences or amino acid positions conserved in the primary structure 
and/or conserved in higher protein structures, e.g. secondary or tertiary structure. 
Orthologous nucleic acids or genes can comprise molecules having short 
stretches of one or more homologous (same or similar) sequences, for example 
protein binding boxes or structure forming boxes. Methods for the identification of 
a candidate ortholog of a gene or polypeptide described herein are known to 
those skilled in the art and are described for example in Sambrook et al. (1989), 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
New York, or Ausubel (1994), Current Protocols in Mol. Biol.. The person skilled in 
the art knows how to identify orthologous genes, nucleic acids or polypeptides by 
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computer supported analysis (e.g. BLAST) of known sequences and its 
interpretation. 

The terms "gene", "polynucleotide", "nucleic acid sequence", "nucleotide 
sequence", "DNA sequence" or "nucleic acid molecule" as used herein refer to 
polymeric forms of nucleotides of any length, either ribonucleotides or 
deoxyribonucleotides and only to the primary structure of the molecule. Thus, 
these terms include double- and single-stranded DNA, and RNA. They also 
include known types of modifications, for example, methylation, "caps" 
substitution of one or more of the naturally occurring nucleotides with an analog. 
Preferably, the DNA sequence of the invention comprises a coding sequence 
encoding at least the mature form of the above defined protein, i.e. the protein 
which is posttranslationally processed in its biologically active form, for example 
due to cleavage of leader or secretory sequences or a proprotein sequence or 
other natural proteolytic cleavage points. 

The term "plurality of candidate antagonists or inhibitors" is to be understood as a 
plurality of substances which may or may not be identical. 

Said antagonists or inhibitors or plurality of candidate antagonists or inhibitors 
may be chemically synthesized or microbiologically produced and/or comprised in, 
for example, samples, e.g., cell extracts from, e.g., plants, animals or 
microorganisms. Furthermore, said compound(s) may be known in the art but 
hitherto not known to be capable of suppressing or inhibiting said polypeptide. 
The reaction mixture may be a cell free extract or may comprise a cell or tissue 
culture. Suitable set ups for the method of the invention are known to the person 
skilled in the art and are, for example, generally described in Alberts et al., 
Molecular Biology of the Cell, third edition (1994), in particular Chapter 17. The 
plurality of compounds may be, e.g., added to the reaction mixture, culture 
medium, injected into the cell or sprayed onto the plant. 

By combining computational processing of genomic information with microbial 
genetics, the inventors have been able to identify 24 E. coli essential genes and 
their respective orthologs (Fig. 3) that fulfill several criteria for being attractive 
antibacterial targets: hypothetical open reading frames, coding for essential 
functions (mutation is lethal for growth in rich media), broad conservation 
(orthologs are present in a wide range of bacteria including H, influenza. S. 
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pneumoniae. H. pylori, and B. burgdorferi) (Fig. 3) and low toxicity potential in 
higher organisms (mostly no orthologs are identified in the simple eukaryote S. 
cerevisiae). Thus, an antagonist or inhibitor of the expression of such an essential 
gene or of its function provides the key for an antibacterial therapy. The inventors 
assume that said antagonist or inhibitor stops or reduces bacterial growth and/or 
mediates bacterial death. 

Thus, the method of the present invention provides the options of development of 
new broad spectrum antibiotics against new pharmaceutical important targets. 
The findings of the present invention are particularly important in view of the 
drawbacks of the present forms of treatment of bacterial infections, diseases and 
disorders related to bacterial infections. 

In line with the above, the present invention also relates to a method for testing a 
candidate antagonist or inhibitor of a polypeptide or mRNA essential for bacterial 
growth or survival encoded by a gene selected from the group consisting of ygbB, 
yfhC, yacE, ychB, yejD, yrfl, yggj, yjeE, yiaO, yrdC, yhbC, ygbP, ybeY, gcpE, 
kdtB, pfs, ycaJ, b1808 ( yeaA t yagF, b1983, yidD, yceG and/or yjbC or a fragment, 
derivative or ortholog thereof comprising the steps of 

(a) contacting a bacterial cell with candidate antagonist or inhibitor or a sample 
comprising a plurality of said candidate antagonists or inhibitors; and 

(b) testing whether said contacting leads to cell growth inhibition and/or cell 
death. 

In a further embodiment, the present invention relates to a method for testing a 
candidate antagonist or inhibitor of the function of a gene essential for bacterial 
growth or survival wherein said gene is selected from the group consisting of 
ygbB, yfhC, yacE, ychB, yejD, yrfl, yggj, yjeE, yiaO, yrdC, yhbC, ygbP, ybeY, 
gcpE, kdtB, pfs, yea J, b1808, yeaA, yagF, b1983, yidD, yceG and/or yjbC or a 
fragment, derivative or ortholog thereof, comprising the steps of 
(a) contacting a bacterial cell comprising said gene with a candidate antagonist 

or inhibitor or a sample comprising a plurality of said candidate antagonists 

or inhibitors; and 
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(b) testing whether said contacting leads to cell growth inhibition and/or cell 
death. 

Bacteria, for which was shown that a gene as mentioned above expressed is 
essential, can be used in a proliferation assay to identify both ligands and 
potential antagonists or inhibitors to said polypeptide encoded by said essential 
gene. For example, E. coli are grown in culture medium and incorporation of DNA 
precursors such as 3 H-thymidine or 5-bromo-2'-deoxyuridine (BrdU) is monitored 
as a parameter for DNA synthesis and cellular proliferation. Cells which have 
incorporated BrdU into DNA can be detected using a monoclonal antibody against 
BrdU and measured by an enzyme or fluorochrome-conjugated second antibody. 
The reaction is quantitated by fluorimetry or by spectrophotometry. The ability of 
the compound to be screened to inhibit proliferation may then be quantified. 
Further methods to determine growth and proliferation of bacteria are well known 
in the art, for example in Drews, Mikrobiol. Praktikum, Berlin, 1976. 
Preferably, the antagonist or inhibitor binds to the gene product, i.e. the RNA or 
polypeptide, specifically encoded by said gene. 

For example, a candidate antagonist or inhibitor not known to be capable of 
binding to an polypeptide encoded by a essential gene as described above can be 
tested to bind thereto comprising contacting a bacterial cell comprising an isolated 
molecule encoding said polypeptide with a candidate antagonist or inhibitor under 
conditions permitting binding of ligands known to bind thereto, detecting the 
presence of any bound ligand, and thereby determining whether such candidate 
antagonist or inhibitor inhibits the binding of a ligand to a polypeptide as described 
above. 

Proteins that bind to a polypeptide as described above and might inhibit or 
counteract to said polypeptide can be "captured" using the yeast two-hybrid 
system (Fields, Nature 340 (1989), 245-246). A modified version of the yeast two- 
hybrid system has been described by Roger Brent and his colleagues (Gyuris, 
Cell 75 (1993), 791-803; Zervos, Cell 72 (1993), 223-232). Briefly, a domain of the 
polypeptide is used as bait for binding compounds. Positives are then selected by 
their ability to grow on plates lacking leucine, and then further tested for their 
ability to turn blue on plates with X-gal, as previously described in great detail 
(Gyuris, supra; WO 95/31544). Once amino acid sequences are identified which 
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bind to a polypeptide essential for bacterial growth or survival these sequences 
can be screened for antagonist activity using, for example, the proliferation assay 
described above or used for screening for antagonists of said binding. 
Another assay which can be performed to identify inhibitors and antagonists 
involves the use of combinatorial chemistry to produce random peptides which 
then can be screened for both binding affinity and antagonist effects. One such 
assay has recently been performed using random peptides expressed on the 
surface of a bacteriophage (Wu (1996), Nature Biotechnology 14, 429-431). 

In a preferred embodiment of the method of the present invention said method 
further comprises identifying an antagonist or inhibitor optionally from said sample 
of candidate antagonists or inhibitors. 

If a sample contains a candidate antagonist or inhibitor, or a plurality of candidate 
antagonists or inhibitors, as identified in the method of the invention, then it is 
either possible to isolate the candidate antagonists or inhibitors from the original 
sample identified as containing the compound capable of suppressing or inhibiting 
bacterial growth or survival, or one can further subdivide the original sample, for 
example, if it consists of a plurality of different candidate antagonists or inhibitors, 
so as to reduce the number of different substances per sample and repeat the 
method with the subdivisions of the original sample. Depending on the complexity 
of the samples, the steps described above can be performed several times, 
preferably until the sample identified according to the method of the invention only 
comprises a limited number of or only one substance(s). Preferably said sample 
comprises substances of similar chemical and/or physical properties, and most 
preferably said substances are identical. As regards the identification of candidate 
antagonists or inhibitors by any of the above-referenced embodiments of the 
invention, a variety of formats or tools is available to the person skilled in the art. 
Thus, several methods are known to the person skilled in the art for producing 
and screening large libraries to identify compounds having specific affinity for a 
target. These methods include the phage-display method in which randomized 
peptides are displayed from phage and screened by affinity chromatography to an 
immobilized receptor see, e.g., WO 91/17271, WO 92/01047, US-A-5,223.409. In 
another approach, combinatorial libraries of polymers immobilized on a chip are 
synthesized using photolithography; see, e.g., US-A-5. 143,854, WO 90/15070 
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and WO 92/10092. The immobilized polymers are contacted with a labeled 
receptor and scanned for label to identify polymers binding to the receptor. The 
synthesis and screening of peptide libraries on continuous cellulose membrane 
supports that can be used for identifying binding ligands of the polypeptide of the 
invention and thus possible inhibitors and antagonists is described, for example, 
in Kramer. Methods Mol. Biol. 87 (1998), 25-39. This method can also be used, 
for example, for determining the binding sites and the recognition motifs in the 
polypeptide as described above. In like manner, the substrate specificity of the 
DnaK chaperon was determined and the contact sites between human interleukin- 
6 and its receptor; see Rudiger, EMBO J. 16 (1997), 1501-1507 and Weiergraber, 
FEBS Lett. 379 (1996), 122-126, respectively. Furthermore, the above-mentioned 
methods can be used for the construction of binding supertopes derived from the 
polypeptide of the invention. A similar approach was successfully described for 
peptide antigens of the anti-p24 (HIV-1) monoclonal antibody; see Kramer, Cell 91 
(1997), 799-809. A general route to fingerprint analyses of peptide-antibody 
interactions using the clustered amino acid peptide library was described in 
Kramer, Mol. Immunol. 32 (1995), 459-465. In addition, antagonists or inhibitors of 
a polypeptide described above can be derived and identified from monoclonal 
antibodies that specifically react with said polypeptide in accordance with the 
methods as described in Doring, Mol. Immunol. 31 (1994), 1059-1067. 
More recently, WO 98/25146 described further methods for screening libraries of 
complexes for compounds having a desired property, especially, the capacity to 
agonize, bind to, or antagonize a polypeptide or its cellular receptor. The 
complexes in such libraries comprise a compound under test, a tag recording at 
least one step in synthesis of the compound, and a tether susceptible to 
modification by a reporter molecule. Modification of the tether is used to signify 
that a complex contains a compound having a desired property. The tag can be 
decoded to reveal at least one step in the synthesis of such a compound. Other 
methods for identifying compounds which interact with the proteins according to the 
invention or nucleic acid molecules encoding such molecules are, for example, the 
in vitro screening with the phage display system as well as filter binding assays or 
"real time" measuring of interaction using, for example, the BIAcore apparatus 
(Pharmacia). 
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All these methods can be used in accordance with the present invention to identify 
antagonists and inhibitors of the polypeptide of the invention. 

Additionally, the present invention relates in a preferred embodiment to a method 
comprising improving inhibitors or antagonists identified by peptidomimetics or by 
applying phage display or combinatorial library technique step(s). 
Peptidomimentics, phage display and combinatorial library techniques are well- 
known in the art and can be applied by the person skilled in the art without further 
ado to the improvement of the antagonist or inhibitor that is identified by the basic 
method referred to herein above. 

Methods for the generation and use of peptidomimetic combinatorial libraries are 
described in the prior art, for example in Ostresh, Methods In Enzymology 267 
(1996), 220-236; Dosner, Bioorg. Med. Chem. 4 (1996), 709-715; Beeley, Trends 
Biotechn. 12 (1994), 213-216; al-Obeidi, Mol. Biotechn. 9 (1998), 205-223; Wiley, 
Med. Res. Rev. 13 (1993), 327-384; Bohm, J. Comput. Aided Mol. Des. 10 
(1996), 265-272; and Hruby, Biopolymers 43 (1997), 219-266. 
Various sources for the basic structure of such an antagonist or inhibitor can be 
employed and comprise, for example, mimetic analogs of the polypeptide of the 
invention. Mimetic analogs of the polypeptide of the invention or biologically active 
fragments thereof can be generated by, for example, substituting the amino acids 
that are expected to be essential for the biological activity with, e.g., 
stereoisomers, i.e. D-amino acids; see e.g., Tsukida, J. Med. Chem. 40 (1997), 
3534-3541. Furthermore, in case fragments are used for the design of biologically 
active analogs pro-mimetic components can be incorporated into a peptide to 
reestablish at least some of the conformational properties that may have been lost 
upon removal of part of the original polypeptide; see, e.g., Nachman, Regul. Pept. 
57 (1995), 359-370. Furthermore, the polypeptide can be used to identify 
synthetic chemical peptide mimetics that bind to or can function as a ligand. 
substrate, binding partner or the receptor of the polypeptide as effectively as does 
the natural polypeptide; see, e.g., Engleman, J. Clin. Invest. 99 (1997), 2284- 
2292. 

The structure-based design and synthesis of low-molecular-weight synthetic 
molecules that mimic the activity of the native biological polypeptide is further 
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described in, e.g., Dowd, Nature Biotechnol. 16 (1998), 190-195; Kieber-Emrnons, 
Current Opinion Biotechnol. 8 (1997), 435-441; Moore, Proc. West Pharmacol. 
Soc. 40 (1997), 115-119; Mathews, Proc. West Pharmacol. Soc. 40 (1997), 121- 
125; Mukhija, European J. Biochem. 254 (1998), 433-438. 

It is also well known to the person skilled in the art, that it is possible to design, 
synthesize and evaluate mimetics of small organic compounds that, for example, 
can act as a substrate or ligand to a polypeptide as encoded by the essential 
gene as identified above. For example, it has been described that D-glucose 
mimetics of hapalosin exhibited similar efficiency as hapalosin in antagonizing 
multidrug resistance assistance-associated protein in cytotoxicity; see Dinh, J. 
Med. Chem. 41 (1998), 981-987. 

The essential gene described above or the RNA encoded thereof, as has been 
described above, can also serve as a target for antagonists or inhibitors. 
Antagonists may comprise, for example, proteins that bind to the mRNA of said 
gene, thereby destabilizing the native conformation of the mRNA and disturbing 
transcription and/or translation. Furthermore, methods are described in the 
literature for identifying nucleic acid molecules such as an RNA fragment that 
mimics the structure of a defined or undefined target RNA molecule to which a 
compound binds inside of a cell resulting in retardation of cell growth or cell death; 
see, e.g., WO 98/18947 and references cited therein. These nucleic acid 
molecules can be used for identifying unknown compounds of pharmaceutical 
and/or agricultural interest, and for identifying unknown RNA targets for use in 
treating a disease. These methods and compositions can be used in screening for 
novel antibiotics, bacteriostatics, or modifications thereof or for identifying 
compounds useful to alter expression levels of proteins encoded by a nucleic acid 
molecule. Alternatively, for example, the conformational structure of the RNA 
fragment which mimics the binding site can be employed in rational drug design to 
modify known antibiotics to make them bind more avidly to the target. One such 
methodology is nuclear magnetic resonance (NMR), which is useful to identify 
drug and RNA conformational structures. Still other methods are, for example, the 
drug design methods as described in WO 95/35367, US-A-5,322,933, where the 
crystal structure of the RNA fragment can be deduced and computer programs 
are utilized to design novel binding compounds which can act as antibiotics. 
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The candidate antagonists and inhibitors which can be tested and identified 
according to a method of the invention may be taken from expression libraries, 
e.g., cDNA expression libraries, peptides, proteins, nucleic acids, antibodies, 
small organic compounds, hormones, peptidomimetics, PNAs or the like (Milner, 
Nature Medicine 1 (1995), 879-880; Hupp, Cell 83 (1995), 237-245; Gibbs, Cell 79 
(1994), 193-198 and references cited supra). Furthermore, genes encoding a 
putative regulator of an essential bacterial protein and/or which exert their effects 
up- or downstream said protein may be identified using, for example, insertion 
mutagenesis using, for example, gene targeting vectors known in the art (see, 
e.g., Hayashi, Science 258 (1992), 1350-1353; Fritze and Walden, Gene 
activation by T-DNA tagging. In Methods in Molecular biology 44 (Gartland, 
K.M.A. and Davey, M.R., eds). Totowa: Human Press (1995), 281-294) or 
transposon tagging (Chandlee, Physiologia Plantarum 78 (1990), 105-115). Said 
compounds can also be functional derivatives or analogues of known inhibitors or 
antagonists. Such useful compounds can be for example transacting factors 
which bind an above-described polypeptide. Identification of transacting factors 
can be carried out using standard methods in the art (see, e.g., Sambrook, supra, 
and Ausubel, supra). To determine whether a protein binds to the protein or 
regulatory sequence of the invention, standard native gel-shift analyses can be 
carried out. In order to identify a transacting factor which binds to the protein or 
regulatory sequence of the invention, the protein or regulatory sequence of the 
invention can be used as an affinity reagent in standard protein purification 
methods, or as a probe for screening an expression library. The identification of 
nucleic acid molecules which encode proteins which interact with the polypeptide 
described above can also be achieved, for example, as described in Scofield 
(Science 274 (1996), 2063-2065) by use of the so-called yeast "two-hybrid system"; 
see also the appended example. In this system, e.g., the protein encoded by the 
nucleic acid molecules identified in this invention or a smaller part thereof is linked to 
the DNA-binding domain of the GAL4 transcription factor. A yeast strain expressing 
this fusion gene and comprising a lacZ reporter gene driven by an appropriate 
promoter, which is recognized by the GAL4 or LexA transcription factor, is 
transformed with a library of cDNAs which will express plant genes or fragments 
thereof fused to an activation domain. Thus, if a peptide encoded by one of the 
cDNAs is able to interact with the fusion peptide comprising a peptide of a protein of 
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the invention, the complex is able to direct expression of the reporter gene. In this 
way the nucleic acid molecules and the encoded peptide can be used to identify 
peptides and proteins interacting with the polypeptide described above. It is 
apparent to the person skilled in the art that this and similar systems may then 
further be exploited for the identification of inhibitors or antagonists of the 
polypeptide. 

Once the transacting factor is identified, modulation of its binding to or regulation 
of expression of the polypeptide described above can be pursued, beginning with, 
for example, screening for inhibitors against the binding of the transacting factor to 
the protein specified in accordance with the present invention. Inhibition of 
bacterial growth could then be achieved by applying the transacting factor (or its 
inhibitor). In addition, if the active form of the transacting factor is a dimer, 
dominant-negative mutants of the transacting factor could be made in order to 
inhibit its activity. 

Thus, the present invention also relates to the use of the polypeptide as defined 
above for the identification of antagonists or inhibitors of a polypeptide essential 
for bacterial growth or survival. 

In another embodiment, the present invention relates to a method for designing an 
improved antagonist or inhibitor for the treatment of a bacterial infection or 
disorder or disease related to a bacterial infection comprising the steps of 

(a) identification of the binding site of an antagonist or inhibitor to the 
polypeptide ygbB, yfhC, yacE, ychB, yejD, yrfl, yggJ, yjeE, yiaO, yrdC, 
yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, b1808, yeaA, yagF, b1983, yidD, 
yceG and/or yjbC, the sequence of said genes being shown in Fig. 1, or 
identified according to the method of the present invention, by site-directed 
mutagenesis and chimeric polypeptide studies; 

(b) molecular modeling of both the binding site of said antagonist or inhibitor 
and the structure of said polypeptide; and 

(c) modification of said antagonist or inhibitor to improve its binding specificity 
or affinity for the polypeptide. 

Biological assays as described above or other assays such as assays based on 
crystallography or NMR may be employed to assess the specificity or potency of 
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the antagonist or inhibitor wherein the decrease of one or more activities of the 
polypeptide may be used to monitor said specificity or potency. All techniques 
employed in the various steps of the method of the invention are conventional or 
can be derived by the person skilled in the art from conventional techniques 
without further ado. 

For example, identification of the binding site of said antagonist or inhibitor by site- 
directed mutagenesis and chimerical protein studies can be achieved by 
modifications in the (poly)peptide primary sequence that affect the antagonist's or 
inhibitor's affinity; this usually allows to precisely map the binding pocket for the 
drug. Identification of binding sites may be assisted by computer programs. Thus, 
appropriate computer programs can be used for the identification of interactive 
sites of a putative antagonist or inhibitor and the polypeptide of the invention by 
computer assisted searches for complementary structural motifs (Fassina, 
Immunomethods 5 (1994), 114-120). 

As regards step (b), the following protocols may be envisaged: Once the effector 
site for antagonists or inhibitors has been mapped, the precise residues 
interacting with different parts of the antagonists or inhibitors can be identified by 
combination of the information obtained from mutagenesis studies (step (a)) and 
computer simulations of the structure of the binding site provided that the precise 
three-dimensional structure of the antagonists or inhibitors is known (if not, it can 
be predicted by computational simulation). If said antagonist or inhibitor is itself a 
peptide, it can be also mutated to determine which residues interact with others in 
the above-mentioned polypeptide essential for bacterial growth and survival. 
Finally, in step (c) the antagonist or inhibitor can be modified to improve its 
binding affinity or its potency and specificity. If, for instance, there are electrostatic 
interactions between a particular residue of an polypeptide as defined above and 
some region of an antagonist or inhibitor molecule, the overall charge in that 
region can be modified to increase that particular interaction. Furthermore, the 
three-dimensional and/or crystallographic structure of inhibitors or antagonists of 
the polypeptide of the invention can be used for the design of peptidomimetic 
inhibitors or antagonists, e.g. in combination with said polypeptide (Rose, 
Biochemistry 35 (1996), 12933-12944; Rutenber, Bioorg. Med, Chem. 4 (1996), 
1545-1558). 
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Potential antagonists/inhibitors include antisense molecules. Antisense 
technology can be used to control gene expression through antisense DNA or 
through triple-helix formation. Antisense techniques are discussed, for example, in 
Okano, J. Neurochem. 56 (1991), 560; Oligodeoxynucleotides as Antisense 
Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988). Triple helix 
formation is discussed in, for instance, Lee, Nucl. Acids Res. 6 (1979), 3073; 
Cooney, Science 241 (1988), 456; and Dervan, Science 251 (1991), 1360. The 
methods are based on binding of a polynucleotide to a complementary DNA or 
RNA. 

For example, the 5' coding portion of a polynucleotide that encodes the mature 
polypeptide as described above may be used to design an antisense RNA 
oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide 
is designed to be complementary to a region of the gene involved in transcription 
thereby preventing transcription and the production of the protein. The antisense 
RNA oligonucleotide hybridizes to the mRNA and blocks translation of the mRNA 
molecule into receptor polypeptide. As indicated, antagonist or inhibitor e.g. 
polyclonal and monoclonal antibody according to the teachings of the present 
invention can be raised according to the methods disclosed in Tartaglia, J. Biol. 
Chem. 267 (1992), 4304-4307; Tartaglia, Cell 73 (1993), 213-216, and PCT 
Application WO 94/09137. 

Antibodies may be prepared by any of a variety of methods using immunogens of 
the polypeptide described above. As indicated, such immunogens include the full 
length polypeptide (which may or may not include the leader sequence) and 
fragments such as the ligand binding domain, the extracellular domain and the 
intracellular domain. These antibodies can be monoclonal antibodies, polyclonal 
antibodies or synthetic antibodies as well as fragments of antibodies, such as 
Fab*, Fv, F(ab') 2 , disulphide-bridged Fv or scFv fragments, etc. Monoclonal 
antibodies can be prepared, for example, by the techniques as originally 
described in Kohler and Milstein, Nature 256 (1975), 495, and Galfre, Meth. 
Enzymol. 73 (1981), 3, which comprise the fusion of mouse myeloma cells to 
spleen cells derived from immunized mammals. Furthermore, antibodies or 
fragments thereof to the aforementioned peptides can be obtained by using 
methods which are described, e.g., in Harlow and Lane "Antibodies, A Laboratory 
Manual", CSH Press, Cold Spring Harbor. 1988. 
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The antagonists or inhibitors isolated by the above methods also serve as lead 
compounds for the development of analog compounds. The analogs should have 
a stabilized electronic configuration and molecular conformation that allows key 
functional groups to be presented to the receptor in substantially the same way as 
the lead compound. In particular the analog compounds have spatial electronic 
properties which are comparable to the binding region, but can be smaller 
molecules than the lead compound, frequently having a molecular weight below 
about 2 kD and preferably below about 1 kD. Identification of analog compounds 
can be performed through use of techniques such as self-consistent field (SCF) 
analysis, configuration interaction (CI) analysis, and normal mode dynamics 
analysis. Computer programs for implementing these techniques are available; 
e.g., Rein, Computer-Assisted Modeling of Receptor-Ligand Interactions (Alan 
Liss, New York, 1989). Methods for the preparation of chemical derivatives and 
analogues are well known to those skilled in the art and are described in, for 
example, Beilstein, Handbook of Organic Chemistry, Springer edition New York 
Inc., 175 Fifth Avenue, New York, N.Y. 10010 U.S.A. and Organic Synthesis, 
Wiley, New York, USA. Furthermore, said derivatives and analogues can be 
tested for their effects according to methods known in the art. Furthermore, 
peptidomimetics and/or computer aided design of appropriate derivatives and 
analogues can be used, for example, according to the methods described above. 

The inhibitor or antagonist identified by the above-described method may prove 
useful as a pesticide, and/or antibiotic. The inhibitors and antagonists of the 
present invention preferably have a specificity at least substantially identical to the 
binding specificity of the natural ligand or binding partner of the polypeptide 
described above. An antagonist or inhibitor can have a binding affinity to said 
polypeptide of at least 10 5 M" 1 , preferably higher than 10 7 M' 1 and advantageously 
up to 10 10 M* 1 . In a preferred embodiment, an inhibitor, e.g. suppressive antibody, 
has an affinity of at least about 10~ 7 M, preferably at least about 10' 9 M and most 
preferably at least about 10* 11 M; and the antagonist has an affinity of less than 
about 10* 7 M, preferably less than about 10* 9 M and most preferably in order of 10' 
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In the case of nucleic acid molecules it is preferred that they have a binding 
affinity to those encoding the amino acid sequences encoded in any one of SEQ 
ID NOS: 16 to 39 of at most 2-, 5- or 10-fold less than an exact complement of 20 
consecutive nucleotides of the above described nucleic acid molecules. 

In another embodiment, the present invention relates to a method for producing a 
therapeutic agent comprising synthesizing the above-described antagonist or 
inhibitor. 

Preferably, the compound identified according to the above described method or 
its analog or derivative is further formulated in a therapeutically active form or in a 
form suitable for the application against bacterial infections or diseases related to 
such an infection. For example, it can be combined with a pharmaceutical^ 
acceptable carrier known in the art. Thus, the present invention also relates to a 
method of producing a (therapeutically effective) composition comprising the 
steps of one of the above described methods of the invention and combining the 
compound obtained or identified in the method of the invention or an analog or 
derivative thereof with a pharmaceutical^ acceptable carrier. 

Also, the present invention relates to a composition comprising the antagonist or 
inhibitor mentioned above. As is evident from the above, the present invention 
generally relates to compositions comprising at least one of the aforementioned 
antagonists or inhibitors, which may be nucleic acid molecules, proteins or 
antibodies. Advantageously, said composition is for use as a medicament, a 
diagnostic means, or a kit. 

The term "composition", as used in accordance with the present invention, 
comprises at least one small molecule or molecule as identified herein above, 
such as a protein, an antigenic fragment of said protein, a fusion protein, a nucleic 
acid molecule and/or an antibody as described above and, optionally, further 
molecules, either alone or in combination, like e.g. molecules which are capable 
of optimizing antigen processing, cytokines, immunoglobulins, lymphokines or 
CpG-containing DNA stretches or, optionally, adjuvants. The composition may be 
in solid, liquid or gaseous form and may be, inter alia, in form of (a) powder(s), (a) 
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tablet(s), (a) solution(s) or (an) aerosol(s). In a preferred embodiment, said 
composition comprises at least two, preferably three, more preferably four, most 
preferably five differentially synthesized proteins. 

The antagonists and inhibitors of the invention appear to function against gene 
products which are essential in several strains or genera of bacteria. Accordingly, 
the above-described antagonists and inhibitors may be used to inhibit the growth 
of a wide spectrum of bacteria. The above described antagonists or inhibitors may 
be used to slow, stop, or reverse bacterial growth. Thus, the present invention 
also relates to a method of producing a therapeutic agent comprising the steps of 
the methods described hereinbefore and synthesizing the antagonist or inhibitor 
obtained or identified as described above or an analog or derivative thereof, 
preferably in an amount sufficient to provide said agent in a therapeutically 
effective amount to a patient. 

Compounds identified by the above methods or analogs are formulated for 
therapeutic use as pharmaceutical compositions. The compositions can also 
include, depending on the formulation desired, pharmaceutical^ acceptable, 
usually sterile, non-toxic carriers or diluents, which are defined as vehicles 
commonly used to formulate pharmaceutical compositions for animal or human 
administration. The diluent is selected so as not to affect the biological activity of 
the combination. Examples of such diluents are distilled water, physiological 
saline, Ringer's solutions, dextrose solution, and Hank's solution. In addition, the 
pharmaceutical composition or formulation may also include other carriers, 
adjuvants, or nontoxic, nontherapeutic, nonimmunogenic stabilizers and the like. 
A therapeutically effective dose refers to that amount of protein or its antibodies, 
antagonists, or inhibitors which ameliorate the symptoms or condition. 
Therapeutic efficacy and toxicity of such compounds can be determined by 
standard pharmaceutical procedures in cell cultures or experimental animals, e.g., 
ED50 (the dose therapeutically effective in 50% of the population) and LD50 (the 
dose lethal to 50% of the population). The dose ratio between therapeutic and 
toxic effects is the therapeutic index, and it can be expressed as the ratio, 
LD50/ED50. 

Compositions comprising such carriers can be formulated by well known 
conventional methods. These pharmaceutical compositions can be administered 
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to the subject at a suitable dose. Administration of the suitable compositions may 
be effected by different ways, e.g., by intravenous, intraperitoneal, subcutaneous, 
intramuscular, topical, intradermal, intranasal or intrabronchial administration. The 
dosage regimen will be determined by the attending physician and clinical factors. 
As is well known in the medical arts, dosages for any one patient depends upon 
many factors, including the patient's size, body surface area, age, the particular 
compound to be administered, sex, time and route of administration, general 
health, and other drugs being administered concurrently. Proteinaceous 
pharmaceutical^ active matter may be present in amounts between 1 ng and 10 
mg per dose; however, doses below or above this exemplary range are 
envisioned, especially considering the aforementioned factors. Administration of 
the suitable compositions may be effected by different ways, e.g., by intravenous, 
intraperitoneal, subcutaneous, intramuscular, topical or intradermal administration. 
If the regimen is a continuous infusion, it should also be in the range of 1 pg to 10 
mg units per kilogram of body weight per minute, respectively. Progress can be 
monitored by periodic assessment. The compositions of the invention may be 
administered locally or systemically. Administration will generally be parenterally, 
e.g., intravenously. The compositions of the invention may also be administered 
directly to the target site, e.g., by biolistic delivery to an internal or external target 
site or by catheter to a site in an artery. Preparations for parenteral administration 
include sterile aqueous or non-aqueous solutions, suspensions, and emulsions. 
Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. 
Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or 
suspensions, including saline and buffered media. Parenteral vehicles include 
sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, 
lactated Ringer's, or fixed oils. Intravenous vehicles include fluid and nutrient 
replenishers, electrolyte replenishers (such as those based on Ringer's dextrose), 
and the like. Preservatives and other additives may also be present such as, for 
example, antimicrobials, anti-oxidants, chelating agents, and inert gases and the 
like. Furthermore, the pharmaceutical composition of the invention may comprise 
further agents such as interleukins, interferons and/or CpG-containing DNA 
stretches, depending on the intended use of the pharmaceutical composition. 
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In another embodiment, the present invention relates to a kit comprising at least 
one of the aforementioned antagonists or inhibitors of the invention. The kit of the 
invention as well as the composition may in a preferred embodiment contain 
further ingredients such as selection markers, antibiotics, cytokines and 
components for simplifying or supporting the treatment of bacterial infections or 
disorders or diseases related to bacterial infections. The kit of the invention may 
advantageously be used for carrying out the method of the invention and could 
be, inter alia, employed in a variety of applications referred to herein, e.g., in the 
diagnostic field or as research tool. The parts of the kit of the invention can be 
packaged individually in vials or in combination in containers or multicontainer 
units. Manufacture of the kit follows preferably standard procedures which are 
known to the person skilled in the art. The kit or its ingredients according to the 
invention can be used in antibacterial therapies, for example, for any of the above 
described methods for detecting further inhibitors and antagonists essential for 
bacterial growth and survival. The kit of the invention and its ingredients are 
expected to be very useful for the healing and protection of animals and humans 
suffering from a bacterial infection. 

The present invention also relates to a method for treating or preventing bacterial 
infections or diseases or disorders related to bacterial infections comprising the 
step of administering to a subject in need thereof an antagonist or inhibitor 
identified herein above, optionally comprised in a pharmaceutical composition of 
the invention. 

In another embodiment the present invention relates to the use of a polypeptide 
encoded by the gene as identified above or a fragment, derivative or ortholog 
thereof or of any of said genes for the identification of an antagonist or inhibitor of 
said polypeptide fragment, derivate or ortholog or said gene. 

In a further embodiment the present invention relates to the use of said 
polypeptide, the therapeutic agent produced according to the invention, the 
antagonist or inhibitor obtained or identified by the method or use according to the 
invention for the preparation of a pharmaceutical composition for the treatment of 
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(a) bacterial infection(s), disorder(s) and/or disease(s) related to bacterial 
infections. 

In another embodiment the present invention relates to a method for treating or 
preventing bacterial infections or diseases or disorders related to bacterial 
infections comprising the step of administering to a subject in need thereof an 
antagonist or inhibitor identified herein above, optionally comprised in the 
pharmaceutical composition according to the present invention. 

In a further embodiment the present invention relates to the use of the above- 
described polypeptide, a fragment, derivative or ortholog thereof or of any of said 
genes for screening for polypeptides interacting with said polypeptide using 
protein-protein interaction technologies, and/or for validating such interaction as 
being essential for bacterial survival and/or for screening for antagonists or 
inhibitors of such interaction. 

In a further embodiment the present invention relates to the use of the above- 
described polypeptide, a fragment, derivative or ortholog thereof or of any of said 
genes for screening of polypeptide for polypeptide binding to said polypeptide, 
and/or for validating the peptides binding to said polypeptide as preventing growth 
of bacteria or being lethal to bacteria upon expression of said polypeptides in said 
bacteria, and/or for screening for small molecules competitively displacing said 
peptides. 

In another embodiment the present invention relates to the use of a conditional 
mutant of a gene as described above or a fragment, derivative or ortholog thereof 
or of surrogate ligands against said gene expressed in bacteria to induce a lethal 
phenotype in bacteria and/or for the analysis of said bacteria for surrogate 
markers by comparison of RNA or protein profiles in said bacteria with RNA or 
protein profiles in wild type bacteria, and/or the use of said surrogate markers for 
the identification of antagonists of the essential function of said gene. 
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In another embodiment the present invention relates to a method for identifying or 
isolating a surrogate marker comprising the steps as described in the above- 
recited method of the present invention. 

In a further embodiment the present invention relates to a method for identifying 
or isolating a surrogate marker comprising the steps of 

(a) inducing a lethal phenotype in bacteria representing a conditional mutant of 
a gene selected from the group consisting of ygbB t yfhC, yacE, ychB, yejD, 
yrfl, yggJ, yjeE, yiaO, yrdC, yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, 
b1808, yeaA, yagF, b1983, yidD, yceG and/or yjbC; and 

(b) analysing said bacteria comparing the RNA or protein profile of said 
bacteria with wild type bacteria. 

The invention also relates to the above recited genes and polypeptides and 
fragments, derivatives and orthologs thereof. 

These and other embodiments are disclosed and encompassed by the description 
and examples of the present invention. Further literature concerning any one of 
the methods, uses and compounds to be employed in accordance with the 
present invention may be retrieved from public libraries, using for example 
electronic devices. For example the public database "Medline" may be utilized 
which is available on the Internet, for example under 
http://www.ncbi.nlm.nih.gov/PubMed/medline.html. Further databases and 
addresses, such as http://www.ncbi.nlm.nih.gov/, 

http://www.infobiogen.fr/, http://www.fmi.ch/biology/research_tools.html, http://ww 
w.tigr.org/, are known to the person skilled in the art and can also be obtained 
using, e.g., http://www.lycos.com. An overview of patent information in 
biotechnology and a survey of relevant sources of patent information useful for 
retrospective searching and for current awareness is given in Berks, TIBTECH 12 
(1994), 352-364. 

The present invention is further illustrated by reference to the following non- 
limiting examples. 
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Unless stated otherwise in the examples, all recombinant DNA techniques are 
performed according to protocols as described in Sambrook et al. (1989), 
Molecular Cloning : A Laboratory Manual. Cold Spring Harbor Laboratory Press, 
NY or in Volumes 1 and 2 of Ausubel et al. (1994), Current Protocols in Molecular 
Biology, Current Protocols. Standard materials and methods for plant molecular 
work are described in Plant Molecular Biology Labfase (1993) by R.D.D. Croy, 
jointly published by BIOS Scientific Publications Ltd. (UK) and Blackwell Scientific 
Publications (UK). 



Brief description of the figures 

Figure 1: Sequences of the essential bacterial genes identified according to 
the method described in the examples 

Figure 2: PCR strategy and the position of primers used 

Figure 3: Sequence comparison table of essential E.coli genes with proposed 
orthologs from various bacteria. Unfinished genomes are indicated 
by asterisk. Complete genomes were analysed using BlastP2. 
Unfinished genomes were analysed with TBIastN. Orthologous 
sequences can be accessed at the respective WWW links as 
indicated in the footnotes. 

Figure 4: Multiple Sequence Alignment (MSA) of E. coli gene ygbB with 
orthologs in 5 different bacterial organisms including homology 
score. Similar MSA with similar results have been created for all 22 
essential bacterial genes. 

Example 1 



An automated BLASTP-based genome comparisons to identify E. coli FUN genes 
resulted in the following list of 65 candidate genes which are conserved between 
E. coli. B. subtilis, H. influenzae, H. pylori, M. tuberculosis, Ch. trachomatis. B. 
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burgdorferi, T. pallidum. S. pneumoniae, S. aureus. E. faecalis, P. aeruginosa, B. 
pertussis and which were further analysed: 
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a : double mutants were created when the respective genes were paralogues in 
E.coli 



Creating in-frame deletions of £. coli genes 

The subsequent description of the construction of deletion mutants was 
carried out essentially equal for these 77 candidate genes. Particular details will 
exemplarily be described for one gene which gave rise to be essential (yfhC) and 
one which was non-essential (yggV). 



WO 00/61793 PCT/EP00/03135 

26 

1) Principle of th PCR-procedure and primer-design for in frame d I tions: 

Unless an overlapping ORF exists, primers dgenX2 and dgenX3 are 
designed to delete the entire ORF from ATG to STOP, e.g.: 
ATGgttataaatttggagtgtgaaggttattgcgtgTAA (SEQ ID NO: 1) (see figure). The Sp- 
ends of primers dgenXI and dgenX4 contain random nucleotides followed 
preferably by a BamHI site (dgenXI) or a Sail site (dgenX4) for cloning into 
plasmid pK03 (Link et al (1997), J Bac 179: 6228-6237). In most mutants, primers 
dgenX2 and dgenX3 contain a 33 bp tag sequence called "Church-tag". 

Church-tag forward direction: 5 -gttataaatttggagtgtgaaggttattgcgtg-3' (SEQ ID 
NO: 2) 

Church-tag reverse direction: 5'-cacgcaataaccttcacactccaaatttataac-3' (SEQ ID NO: 
3) 

This tag is used for a subsequent PCR in which the 5'- and 3'- flanking DNA- 
fragments of the deletion construct are assembled. 

In the few constructs lacking the "Church-tag", the primers dgenX2 and dgenX3 
carry at their 5'-ends 5 random nucleotides followed by a restriction site 
(preferably EcoRI) which by its positioning creates the in frame deletion. 

Oligos cgenXI and cgenX2 are used for the verification of the chromosomal 
situation (wild type or deletion) after the replacement procedure (Fig. 2). 

Primers for the respective candidate genes were designed as follows: 
dyfhCI: S'-GATCGGATCCAAATTCCAGTTAGCCATGATGCGGTC-S' 
(SEQ ID NO: 4) 

dyfhC2: 5'-CACGCAATAACCTTCACACTCCAAATTTATAACCATTATA 

CACGGACGCTATGC-3* (SEQ ID NO: 5) 
dyfhC3: 5'- 

GTTATAAATTTGGAGTGTGAAGGTTATTGCGTGACGGATTAATT 

TTGTTTCTCTT-3' (SEQ ID NO: 6) 
dyfhC4: S'-GATCGTCGACGCGCTCGATATCACCGATGAACAACCG-S* 

(SEQ ID NO: 7) 

cyfhCI : 5'-C AATCC G CTG CTTT ATTTCTGTC AG-3' (SEQ ID NO: 8) 
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cyfhC2: 5-TTATAACGAAATCAACGGGAAACCT-3' (SEQ ID NO: 9) 



dyggVI : 5'-GATCGGATCCCTCTAAAAAATAAGGAATTAAAGG-3' 
(SEQ ID NO: 10) 

dyggV2: 5'-CACGCAATAACCTTCACACTCCAAATTTATAACCATAGGATAC 

CTAATTAATTAAC-3' (SEQ ID NO: 1 1) 
dyggV3: 5'-GTTATAAATTTGGAGTGTGAAGGTTATTGCGTGAAGAGCGCC 

ATTTCCCACCGT-3' (SEQ ID NO: 12) 
dyggV4: 5'-G ATCGTCG ACTCATATTG CTG ATAACCC G CTG CGGT-3' 

(SEQ ID NO: 13) 

cyggVI : 5'-GTTGACGGCCAGGCCAACAGTCAT-3' (SEQ ID NO: 14) 
cyggV2: 5'-ATAACCCTGGGCAATCGCCTCG-3' (SEQ ID NO: 15) 



Example 2 

Construction of the DNA-fragments comprising the deletion 

The 5'- and the 3'-flanking DNA fragments are PCR amplified in a total 

volume of 50 ul as follows: 

Chromosomal DNA from E. coli strain MG1655 (100 ng/^tl): 

final cone: 1 ng/ul 
10*Pwo-buffer final cone: 1x 

dgenX1/3 (10 ^M) final cone: 500 nM 

dgenX2 (4) (10 uM) final cone: 500 nM 
Pwo-Polymerase final cone: 5 U/1 00 ul 

dNTPs (25 mM) final cone: 250 uM 

H2O to adjust volume to 50 ul 



PCR conditions: 

4' 94 °C 
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30 cycles: 30" 94 °C. 30" 44 °C, 1' 72 °C . 
5' 72 °C 

The PGR products are then purified with the High Pure PCR Purification Kit 
(Boehringer) to remove salts and enzyme (elute in 50 ul H 2 0). Alternatively, if 
PCR products contain prominent impurities, the respective fragment must be 
purified by agarose gel extraction (Gene Clean, Dianova) before the fragment 
assembly. 

Assembly PCR 

Equal amounts of 5'- and 3'-fragment are applied as template DNA. In general a 
volume applied for gel electrophoresis giving an intense band is o.k. The total 
reaction volume is 100 pi. For the assembly the "outer" primers dgenXI and 
dgenX4 were used. 

S'-Fragment 
3'-Fragment 
10*Pwo-buffer 
dgenXI (10 ^M) 
dgenX4(10^M) 
Pwo-Pol (Boehringer) 
dNTPs (25mM) 
H 2 0 

PCR conditions: 

4' 94 °C 

10 cycles: 30" 94 °C, 30" 44 °C, V 72 °C 
25 cycles: 30" 94 °C. 30" 44 °C, 3' 72 °C 
5' 72 °C 

The success of the PCR is checked by agarose gel electrophoresis. The 
assembled PCR product is purified with the High Pure PCR Purification Kit and 
the complete eluate of 50 ul is over-night digested with BamHI and Sail in a 
volume of 60 ul. After gel electrophoresis the digested product is purified with 



approx. 10 ng 
approx. 10 ng 
final cone: 1x 
final cone: 500 nM 
final cone: 500 nM 
final cone: 5 U/100 
final cone: 250 pM 
add to 100 ul 



(50 pmol/100 pi) 
(50 pmol/100 pi) 
Ml 
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Gene Clean (Dianova) to remove small oligonucleotides quantitatively (elution 
volume: 25 pi). 

Cloning into vector pK03: 

Next, the fragment is ligated into the vector pK03 (cut with BamHI and Sail) in a 
10-20 pi reaction (T4-DNA ligase) for 2 hours at room temperature. 

Transformation into DH5: 

One half of the ligation mix is transformed into chemically competent E. coli DH5ot 
and clones are purified once (usually 8 clones are sufficient). 

Verification of deletion constructs: 

1) 8 clones are characterized by colony-PCR with vector pK03-specific primers 
(pK03-B1 and pK03-S1). 

2) Clones with the correct size of insert are double-checked by cotony-PCR with 
gene specific primers (dgenXI and dgenX4). 



Reaction mixture for 25 pi reaction volume: 



template (colony) 
10*Taq-buffer 
5*Q-solution 
pK03-B1/dgenX1 (100 pM) 
pK03-S1/dgenX4 (100 pM) 
Taq-Pol (QIAgen) 
dNTPs (25 mM) 
H 2 0 



1 pi of 1 colony resuspended in 20 pi H 2 0 
final cone: 1x 
final cone: 1x 

final cone: 1 pM (50 pmol/100 pi) 
final cone: 1 pM (50 pmol/100 pi) 

final cone: 2 U/25 pi 

final cone: 250 pM 

15.35 pi 



PCR conditions: 

4' 94 °C 

25 cycles: 30 M 94 °C, 30" 50 °C, 2' 65 °C 
5' 65 °C 
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3) Plasmid-DNA from 4 ml over-night culture is prepared using a QIAgen 
Miniprep Kit and a double restriction analysis with BamHI/Sall and 
EcoRI/Hindlll is performed to verify the clones. 

Protocol referring to the construction of assembly products by a 
restriction site: 

The 5'- and the 3 T -fragments are PCR amplified as described above. The PCR 
products are purified with the High Pure PCR Purification Kit (Boehringer) to 
remove salts and enzyme and 5 to 10 jal are digested over night using the 
restriction site creating the deletion (primers 2 and 3; mostly EcoRI) in a total 
volume of 30 pi. The restriction products are again purified with the High Pure 
PCR Purification Kit to remove nucleotides, salts and enzyme. (Alternatively: 
Following preparative agarose gel electrophoresis the cut fragments are isolated 
using Gene Clean (Dianova) and eluted in a volume of 25 ^l. The cut fragments 
(3-6 jil each) are ligated in a volume of 10-15 y\ using T4-DNA ligase for 2 hours 
at room temperature. 5 \x\ of this ligation mix is directly used as a template for a 
second PCR. In this PCR, the assembled fragments are amplified using primers 
dgenXI and dgenX4. The reaction is set up as described above with two 
exceptions: 1) The total reaction volume is 100 pi and 2) the extension step at 72 
°C lasts 3'. 

Example 3 

The chromosomal exchange strategy 

(Link et al (1997), J Bac 179: 6228-6237) 

Cointegration: 

Cointegration = integration of a plasmid into the chromosome by a recombination 
event 

The pK03 derivative is transformed into MG1655 or any recA+ strain 
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Day 1 

The strain is grown at 30 °C in LB containing 20 ug/ml chloramphenicol (LB- 
Cam20) to an OD 6 oo of -1.0. Afterwards, perform 10-fold serial dilutions in the 
same medium (down to 10' 7 ). For the following plating use prewarmed LB-Cam20 
agar plates. Plate 100 jil of dilutions 10" 4 and 10* 5 for incubation at 44 °C and 100 
nl of dilutions 10" 5 and 10* 7 for incubation at 30 °C. 

Day 2 

Following incubation at the respective temperature, determine the factor 
c.f.u.44 °C/c.f.u.30 °C (c.f.u. = colony forming units). This factor for pK03 without 
insert is in the range 1*10^ to 5M0" 4 and should be significantly larger in the case 
of successful cointegration. Purify 8 randomly chosen clones from the 44 °C plate 
twice on LB-Cam20 agar plates at 44 °C (during Day 2 and over night to Day 3). 
Optionally, confirm the clones for their identity as cointegrates by colony-PCR. 

Resolution and counter-selection: 

Resolution = resolution of the cointegrate resulting in a self replicative plasmid by 
a second recombination event 

Counter-selection = selection against the presence of plasmid in the cell 
Day 3 

Pool single colonies from each of the 8 cointegrates in 100 jil LB and use this 
suspension as an inoculum for 1 0 ml LB+5 %sucrose. After growth at 30 °C (8 to 
10 hours during a day is sufficient) 10-fold serial dilutions are performed and 100 
|il of dilutions 10* 4 , 10~ 5 , and 10* 6 are plated onto LB agar+5 % sucrose and grown 
over night at 30 °C. 

Day 4 

50 single colonies are replica streaked on LB+Cam20 and LB+5 % sucrose to test 
for the loss of plasmid. 
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Example 4 

Testing for essentiality of FUN genes of E. coli and interpretation of the 

results 

Day 5 

The clones sensitive to chloramphenicol are then tested for their genotype (wild 
type versus in-frame deletion) by colony-PCR using primers cgenXI and cgenX2 
(10-48 clones). 

In the case of the gene yfhC out of 48 clones tested only wild type situation on the 
chromosome could be detected. 

In the case of the gene yggV out of 48 clones 16 (= 33 %) revealed a PCR 
product with a size indicative for the deletion situation on the chromosome. 

Are 48 clones revealing no mutant enough to claim a gene as essential? This 
question can be answered by asking for the number of clones that have to be 
tested to get a confidence of e.g. 99 % that really no mutants are present in an 
infinite number of clones. Provided a hypothesis Ho means that only the wild type 
genotype is viable and hypothesis Hi means that a fraction (1-x) of mutants is 
allowed to occur together with the wild type (x) among a population of clones (x + 
(1-x)), then the probability to make the wrong decision (decision for H 0 whereas 
is true) can be calculated as 

(1) x n /(1+x n ) 

where x is the fraction of wild type clones and n is the number of clones tested. 
The confidence niveau a to make the wrong decision (error probability) is given by 

(2) a>x n /(1+x n ) 
thereby resulting in 

(3) n>ln(a/(1-a))/ln(x) 

for the number of clones that have to be tested to prove or disprove hypothesis 
H 0 . 
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If the average probability for obtaining wild type clones (x) in a replacement 
experiment is 70 % (experimentally determined for 43 non-essential genes out of 
65 candidate genes), then, after testing of 26 clones which reveal a wild type 
genotype an uncertainty of 0.01 % error probability (a) remains that the claiming 
of a gene as essential could be wrong. Even if the rate of obtaining wild types (x) 
is set to 85 % (a value which occurs with a frequency of 10 % for replacement 
experiments with non-essential genes), then, by testing 32 clones (which was 
performed in every experiment giving rise to an essential gene) an error 
probability of only 0.6 % remains to chose the wrong hypothesis. 



Examle 5 



List of essential FUN genes obtained 

By the described method the following 24 genes were obtained which gave no 
deletion genotype and are therefore claimed to be essential: 



£. coli 


gene name GenBank# 


ygbB 


g1789103 


yfhC 


g 1788911 


yacE 


g 1786292 


ychB 


g 1787459 


yejD 


g 1788510 


yrfl 


g 1789804 


yggJ 


g 178931 5 


yjeE 


g 1790610 


yiaO 


g 1790004 


yrdC 


g2367210 


yhbC 


g1 789561 


ygbP 


g1 789104 


ybeY 


g 1786880 
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gcpE 


g 1788863 


kdtB 


g1 790065 


pfs 


g1 786354 


ycaJ 


g1 7871 19 


b1808 


g1788110 


yeaA 


g 1788077 


yagF 


g 1786464 


b1983 


g 1788294 


yidD 


g140861 


yceG 


g 1787339 


yjbC 


g396357 
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CLAIMS 

1. A method for identifying an antagonist or inhibitor of the expression of a 
gene encoding a polypeptide essential for bacterial growth or survival 
wherein said gene is selected from the group consisting of ygbB t yfhC, 
yacE, ychB, yejD, yrfl, yggj, yjeE, yiaO, yrdC, yhbC, ygbP, ybeY, gcpE, 
kdtB, pfs, ycaJ, b1808, yeaA, yagF, b1983, yidD, yceG and/or yjbC, the 
sequence of said genes being shown in Fig. 1, or a fragment, derivative or 
ortholog thereof, said method comprising the steps of 

(a) testing a candidate antagonist or inhibitor or a sample comprising 
a plurality of said candidate antagonists or inhibitors for the 
inhibition or reduction of transcription of said gene or a fragment 
or derivative thereof; or 

(b) testing a candidate antagonist or inhibitor or a sample comprising 
a plurality of said candidate antagonists or inhibitors for the 
inhibition or reduction of translation of mRNA transcribed from 
said gene or a fragment or derivative thereof; and 

(c) identifying an antagonist or inhibitor or a sample comprising a 
plurality of said candidate antagonists or inhibitors that tests 
positive in step (a) and/or (b). 

2. A method for testing a candidate antagonist or inhibitor of a polypeptide or 
a mRNA essential for bacterial growth or survival encoded by a gene 
selected from the group consisting of ygbB, yfhC, yacE, ychB, yejD, yrfl, 
yggJ. yjeE, yiaO, yrdC, yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, b1808, 
yeaA, yagF, b1983, yidD, yceG and/or yjbC, the sequence of said genes 
being shown in Fig. 1, or a fragment, derivative or ortholog thereof 
comprising the steps of 

(a) contacting a bacterial cell with a candidate antagonist or inhibitor 
or a sample comprising a plurality of said candidate antagonists 
or inhibitors; and 

(b) testing whether said contacting leads to cell growth inhibition 
and/or cell death. 
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3. A method for testing a candidate antagonist or inhibitor of the function of a 
gene essential for bacterial growth or survival wherein said gene is 
selected from the group consisting of ygbB, yfhC, yacE, ychB, yejD, yrfl, 
yggJ. yjeE, yiaO, yrdC, yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, b1808, 
yeaA, yagF, b1983, yidD t yceG and/or yjbC, the sequence of said genes 
being shown in Fig. 1, or a fragment, derivative or ortholog thereof, 
comprising the steps of 

(a) contacting a bacterial cell comprising said gene with a candidate 
antagonist or inhibitor or a sample comprising a plurality of said 
candidate antagonists or inhibitors; and 

(b) testing whether said contacting leads to cell growth inhibition 
and/or ceil death. 

4. The method of any one of claims 1 to 3 further comprising identifying an 
antagonist or inhibitor, optionally from said sample of candidate 
antagonists or inhibitors. 

5. The method of any one of claims 1 to 4 wherein said inhibitor or 
antagonist is further improved by peptidomimetics or by applying phage 
display or combinatorial library technique step(s). 

6. A method for designing an improved antagonist or inhibitor for the 
treatment of a bacterial infection or disorder or disease related to a 
bacterial infection comprising the steps 

(a) identification of the binding site of an antagonist or inhibitor to the 
polypeptide ygbB, yfhC, yacE, ychB, yejD, yrfl, yggJ, yjeE, yiaO, 
yrdC, yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, b1808, yeaA, 
yagF, b1983, yidD, yceG and/or yjbC, the sequence of said genes 
being shown in Fig. 1 , or obtained by or identified by the method 
of any one of claims 1 to 5 by site-directed mutagenesis and 
chimeric polypeptide studies; 

(b) molecular modeling of both the binding site of said antagonist or 
inhibitor and the structure of said polypeptide; and 
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(c) modification of said antagonist or inhibitor to improve its binding 
specificity or affinity for the polypeptide. 

7. An antagonist or inhibitor of the activity of a polypeptide encoded by a 
gene selected from the group consisting of ygbB, yfhC, yacE, ychB, yejD, 
yrfl, yggJ, yjeE, yiaO, yrdC, yhbC, ygbP, ybeY, gcpE t kdtB, pfs, ycaJ, 
b1808 t yeaA, yagF, b1983, yidD, yceG and/or yjbC, the sequence of said 
genes being shown in Fig. 1 , or fragment, derivative or ortholog thereof or 
of the expression of a gene encoding said polypeptide or said fragment, 
derivative or ortholog or obtained by or identified by the method of any 
one of claims 1 to 6. 

8. A method for producing a therapeutic agent comprising synthesizing the 
antagonist or inhibitor identified, tested or designed according to the 
method of any one of claims 1 to 6 or the antagonist or inhibitor of claim 7 
or an analog or derivative thereof. 

9. A method for producing a composition comprising the steps of the method 
of any one of claims 1 to 6 or synthesizing the antagonist or inhibitor of 
claim 7 and formulating said inhibitor or antagonist in a pharmaceutical^ • 
acceptable form. 

10. A composition comprising an antagonist or inhibitor of claim 7, the 
therapeutic agent produced by the method of claim 8 or the antagonist or 
inhibitor obtained by or identified in the method of any one of claims 1 to 6 
or produced according to claim 9 and optionally a pharmaceutical^ 
acceptable carrier. 

1 1 . The composition of claim 10 which is a pharmaceutical composition. 

12. The composition of claim 10 which is a kit. 

13. The composition of any one of claims 10 to 12 further comprising an 
antibiotic and/or cytokine. 
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14. Use of a polypeptide encoded by a gene selected from the group 
consisting of ygbB, yfhC, yacE, ychB, yejD, yrfl, yggJ, yjeE, yiaO, yrdC, 
yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, b1808, yeaA, yagF, b1983 f 
yidD, yceG and/or yjbC ( the sequence of said genes being shown in Fig. 
1 , or a fragment, derivative or ortholog thereof or of any of said genes for 
the identification of an antagonist or inhibitor of the activity of said 
polypeptide or said fragment, derivative or ortholog or of the expression of 
a gene encoding said polypeptide or said fragment, derivative or ortholog. 

15. Use of an antagonist or inhibitor of claim 7, the therapeutic agent 
produced by the method of claim 8 or the antagonist or inhibitor obtained 
by or identified in the method of any one of claims 1 to 6 or produced 
according to claim 9 or identified by the use of any of the claims for the 
preparation of a pharmaceutical composition for the treatment of (a) 
bacterial infection(s), disorder(s) and/or disease(s) related to bacterial 
infections. 

16. A method for treating or preventing bacterial infections or diseases or 
disorders related to bacterial infections comprising the step of 
administering to a subject in need thereof the antagonist or inhibitor 
obtained by or identified in the method of any one of claims 1 to 6 or 
produced according to claim 9 optionally comprised in the pharmaceutical 
composition according to claim 11. 

17. Use of a polypeptide encoded by a gene selected from the group 
consisting of ygbB, yfhC, yacE, ychB, yejD, yrfl, yggJ, yjeE, yiaO, yrdC, 
yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, b1808, yeaA, yagF, b1983, 
yidD, yceG and/or yjbC, the sequence of said genes being shown in Fig. 
1, or a fragment, derivative or ortholog thereof or any of said genes for 
screening for polypeptides interacting with said polypeptide using protein- 
protein interaction technologies, and/or for validating such interaction as 
being essential for bacterial survival and/or for screening for antagonists 
or inhibitors of such interaction. 
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18. Use of a polypeptide encoded by a gene selected from the group 
consisting of ygbB, yfhC, yacE. ychB, yejD, yrfl, yggJ f yjeE, yiaO, yrdC, 
yhbC, ygbP. ybeY, gcpE, kdtB, pfs r ycaJ, b1808 t yeaA, yagF, b1983, 
yidD, yceG and/or yjbC, the sequence of said genes being shown in Fig. 
1, or a fragment, derivative or ortholog thereof or any of said genes for 
screening of polypeptide for polypeptide binding to said polypeptide, 
and/or for validating the peptides binding to said polypeptide as 
preventing growth of bacteria or being lethal to bacteria upon expression 
of said polypeptides in said bacteria, and/or for screening for small 
molecules competitively displacing said peptides. 

19. Use of conditional mutants in a gene selected from the group consisting of 
ygbB, yfhC, yacE, ychB, yejD, yrfl, yggj, yjeE, yiaO. yrdC, yhbC, ygbP, 
ybeY, gcpE, kdtB, pfs, yea J, b1808, yeaA, yagF, b1983, yidD, yceG 
and/or yjbC, the sequence of said genes being shown in Fig. 1, or a 
fragment, derivative or ortholog thereof or of surrogate ligands against 
said gene expressed in bacteria to induce a lethal phenotype in bacteria 
and/or for the analysis of said bacteria for surrogate markers by 
comparison of RNA or protein profiles in said bacteria with RNA or protein 
profiles in wild type bacteria, and/or the use of said surrogate markers for 
the identification of antagonists of the essential function of said gene. 

20. A method for identifying or isolating a surrogate marker comprising the 
steps as described in claim 19. 

21. A method for identifying or isolating a surrogate marker comprising the 
steps of 

(a) inducing a lethal phenotype in bacteria containing a conditional 
mutant of a gene selected from the group consisting of ygbB, yfhC, 
yacE, ychB, yejD, yrfl, yggj, yjeE, yiaO, yrdC, yhbC, ygbP, ybeY, 
gcpE, kdtB, pfs, ycaJ, b1808, yeaA, yagF, b1983, yidD, yceG and/or 
yjbC, the sequence of said genes being shown in Fig. 1 ; and 
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(b) analysing said bacteria comparing the RNA or protein profile of said 
bacteria with wild type bacteria. 
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Fig. 1 

1) ygbB 

atgcgaattggacacggttttgacgtacatgcctttggcggtgaaggcccaattatcattggtggcgtacgcattccttacgaaaa 

aggattgctggcgcattctgatggcgacgtggcgctccatgcgttgaccgatgcattgcttggcgcggcggcgctgggggatat 

cggcaagctgttcccggataccgatccggcatttaaaggtgccgatagccgcgagctgctacgcgaagcctggcgtcgtattc 

aggcgaagggttatacccttggcaacgtcgatgtcactatcatcgctcaggcaccgaagatgttgccgcacattccacaaatg 

cgcgtgtttattgccgaagatctcggctgccatatggatgatgttaacgtgaaagccactactacggaaaaactgggatttaccg 

gacgtggggaagggattgcctgtgaagcggtggcgctactcattaaggcaacaaaatga (SEQ ID NO: 16) 

2) yfhC 

atgcgccgcgcttttataaccggagttttctttttgtctgaagtcgaatttagccacgaatactggatgcgtcacgcgctgacgctg 

gcgaaacgtgcctgggatgagcgggaagtgccggtcggcgcggtattagtgcataacaatcgggtaatcggcgaaggctg 

gaaccgcccgattggtcgccatgatcccaccgcacatgcagaaatcatggccctgcggcagggtggtctggtgatgcaaaat 

tatcgtctgatcgacgccacgttgtatgtcacgcttgaaccatgtgtaatgtgtgccggagcgatgatccacagtcgcattggtcg 

cgtggtctttggtgcgcgtgacgcgaaaactggcgctgcgggatctttaatggatgtgctgcatcatccgggtatgaatcaccga 

gtggaaattacggaaggaatactggcggatgagtgcgcggcgttgctcagtgacttctttcgcatgcgccgccaggaaattaa 

agcgcagaaaaaagcgcaatcctcgacggattaa (SEQ ID NO: 17) 

3) yacE 

atgaggtatatagttgccttaacgggaggcattggcagtggcaagagtaccgttgccaatgcgtttgctgatctcggaattaacg 
tcattgatgccgatattattgcgcgtcaggtggttgaaccaggtgcacctgcgctacatgccattgctgatc^cttt^ 
tgattgctgctgatggaacattgcagcgccgggccttgcgcgagcggatcttcgccaacccggaagagaaaaactggcttaa 
cgccctgctgcatccgctgattcagcaagagac^caacaccag 

attgctggtagaaaactcactgtataaaaaagcgaatcgagtgcttgtggtggatgtcagcccagaaacgcaacttaagcgc 
accatgcagcgcgatgatgtaactcgcgagcatgtcgaacaaatccttgctgctcaggcaacgcgcgaagcccgccttgccg 
tggcagatgacgtcattgataataacggcgcaccggatgctatcgcatcggatgttgccc^cctgcacgcacactatttgcagc 
ttgcgtcgcagtttgtctcacaggaaaaaccgtaa (SEQ ID NO: 18) 

4) ychB 

atgc^gacacagtggccctctccggcaaaacttaatctgttmatacattaccggtcagcgtgcggatggttaccacac^ 
aaacgctgmcagmcttgattacggcgacaccatcagrattgagct^ 

aaggcgtggaacatgaagataacctgatcgttcgcgcagcgcgattgttgatgaaaactgcggcagacagcgggcgtcttcc 

gacgggaagcggtgcgaatatcagcattgacaagcgtttgccgatgggcggcggtctcggcggtggttcatccaatgccgcg 

acggtcctggtggcattaaatcatctctggcaatgcgggctaagcatggatgagctggcggaaatggggctgacgctgggcg 

cagatgttcctgtctttgttcgggggcatgccgcgtttgccgaaggcgttggtgaaatactaacgccggtggatccgccagaga 

agtggtatctggtggcgcaccctggtgtaagtattccgactccggtgattmaaagatcctgaactcccgcgcaatac^ 

aaggtcaatagaaacgttgctaaaatgtgaattcagcaatgattgcgaggttatcgcaagaaaacgttttcgcgaggttgatgc 

ggtgcfficctggctgttagaatacgccccgtcgcgcctgactggg 

aagcccgccaggtgctagagcaagccccggaatggctcaatggctttgtggcgaaaggcgctaatctttccccattgcacag 
agccatgctttaa (SEQ ID NO: 19) 

5) yejD 

atgcgacttgataaamatcgracagcaactcggcgttagccgtgctattgccgggcgtgaaatccgcggcaatcgtgtra 

gtcgatggcgaaatcgtccgtaatgcagcgttcaaactgcttcctgaacatgatgtcgcttacgatggcaacccgctggcgra 

caacacggtccacgttacttcatgctcaataagcctc^gggctatgtttgctccacggacgaccctgatcacccaac^ 

attttcttgatgaaccggtagcgtggaaactgcatgcggcggggcggttggatattgataccaccggtctggtgctgatgactga 

tgatggtcagtggtcgcaccgcattactttfcc^ 

gacgatacggcagagcaatttgctaaaggcgtgcagctgcataacgaaaaagatctcactaagcctgcggtgctggaagtg 
attaccccaacgraggttcgtctgaccatcagcgaagggcgttatcatcaggtgaaacgcatgttcgccgccgtgggtaacca 
cgtggttgagctgcatcgtgaacgtattggcggtattacgctggatgctgatttagcccccggtgaatatcgtccgttaactgaag 
aagaaattgccagcgtcgtctaa (SEQ ID NO: 20) 
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6) yrfl 

atgattatgccgcaacatgacc^attac^tcgctatctgtttgaaaactttgccgtgcgcggcgaactggtaaccgtttcgga^ 

ccctgcaacagatccttgagaaccacgattatccgcagcccgttaaaaacgtgctggcagaactgctggttgcgaccag^ 

gttaaccgctacgctgaagtttgatggtgatatcaccgtacagctgcagggcgacggtccgatgaatctggcggttattaacggt 

aacaataaccagcagatgcgcggtgtggcgcgcgtgcagggcgaaattccagaaaatgccgacctgaaaacgctggtcg 

gcaatggttacgtggtgatcaccattaccccgagcgaaggcgaacgctatcagggcgtagttggtctggaaggtgataccctg 

gcggcctgcctggaagattactttatgcgttctgaacagctgccgacgcgcctgtttattcgcaccggcgacgtagacggcaaa 

ccggctgcaggcggtatgttgttgcaggtaatgcctgcgcaaaatgcccagcaggacgactttgaccacctggcgacgctaa 

ccgaaaccatcaaaaccgaagaactgctgaccttaccggcaaacgaagtgttgtggcgtttgtatcacgaagaagaggtga 

cggtttacgatccgcaggatgtggagttcaaatgcacctgctcgcgtgaacgttgcgccgatgcgctgaaaacgctgcctgatg 

aagaagttgatagcatcctggcggaagatggcgaaattgacatgcattgtgattactgcggtaaccactatctgttcaatgcgat 

ggatattgctgaaatccgcaacaacgcgtctccggcagatccgcaagttcattaa (SEQ ID NO: 21) 

7) yggJ 

gtggggagacgacgcggatttttaactatgcgtatcccccgcatttatcatcctgaaccactgaccagccattctcacatcgcgct 

ttgcgaagatgccgccaaccatatcgggcgcgtactgcgcatggggccggggcaggcgttgcaattgtttgacggtagcaac 

caggtctttgacgccgaaattaccagcgccagcaaaaaaagcgtggaagtgaaggtgctggaaggccagatcgacgatcg 

cgaatctccgctgcatattcacctcggtcaggtgatgtcgcgtggtgaaaaaatggaatttactatccagaaatcgatcgaactc 

ggtgtaagcctcattacgccacttttttctgagcgctgcggcgttaaactggatagtgaacgtctgaacaagaagcttcagcagt 

ggcagaagattgcaattgctgcctgtgagcagtgtggtcgtaaccgggtgccggaaatccgtccagcgatggatctggaagc 

ctggtgtgcagagcaggatgaaggactgaaactgaatcttcacccgcgcgccagtaacagcatcaatacgttgccgttaccg 

gttgaacgcgtccgcctgctgattggcccggaaggcggtttatcggc^^ 

atatcctgttgggacctcgcgttttgcgtacagagacaactgcgrt^ 

ggctaa (SEQ ID NO: 22) 

8) yjeE 

atgatgaatcgagtaattccgctccctgatgagcaggcaacattagacctgggcgagcgggtagcgaaagcctgcgatggc 

gcaaccgtaatctatctgtatggcgaffiaggcgcaggtaaaaccacctttag 

gtaatgtcaaaagccccacttatacgctggtcgaaccctatacgctcgara^ 

ccgatcccgaggagctggagtttatggggatccgcgattattttgccaacgatgccatctgcctggtggagtggccacaacaa 
ggtacaggtgttcttcctgacccggatgtcgaaatacacattgattatcaggcacaaggccgtgaggcgcgcgtgagtgcggtt 
tcctctgcgggtgaattgttgctggcgcgtttagccggttaa 
(SEQ ID NO: 23) 

9) yiaO 

atgaaattacgctctgtaacctacgcattattcattgctggcctggctgcattcagcaMtcttctctggcggcaca 
cggttatgaaacatcacaaaccgactcgcaacatattgcggcgaaaaaattcaatgatttattgcaggagagaaccaaaggc 
gagctgaaattaaaactgttcccggacagcactctcggtaacgcgcaggcgatgatcagcggcgtacgtggcggcaccatc 
gatatggaaatgtccggctcgaataactttgccgggttatracxagtgatga 

ctcacgcgcataaaacgctcgacggcaaagtcggtgatgatctgaaagcctcacttgaaggtaaaggactgaaagtactgg 

cctactgggaaaacggctggcgcgatgtcaccaactcgcgcgcaccggttaaaacccccgccgacctgaaagggctgaaa 

atccgcaccaacaatagcccgatgaatatcgccgcattcaaagtctttggcgctaacccgatcccgatgccgtttgccgaagtc 

tataccgggctggaaacccgcactatcgacgctcaggaacacccgatcaacgtcgtctggtcagcaaaatttttcgaagtgca 

gaagttcctttctctgacgcaccacgcctattccccgcttctggtggtgatcaacaaagcgaagmgatggcttaagtccggag 

ccagcaggcgctagtttcatctgcacaagaagcgggtaactatcagcgcaaactggttgctgaagatcagcaaaaaatcatc 

gacggcatgaaagaagcgggcgtggaagtcatcaccgatctcgaccgcaaagcctttagcgacgcactggggaatcaggt 

tcgcgacatgtttgttaaagatgtgccgcagggagctgatctgctgaaagccgtggatgaggtgcaataa (SEQ ID NO: 

24) 

10) yrdC 

gtgaataataacctgcaaagagacgctatcgcagctgcgatagatgttctcaatgaagaacgtgtcatcgcctatccaacgga 
agccgttttcggtgttgggtgcgatcctgatagcgaaacagcagtgatgcgactgttggagttaaaacagcgtccggttgataa 
ggggctgattttaatcgcagcaaattacgagcagcttaaaccctatattgatgacaccatgttgactgacgtgcagcgtgaaac 
cattttttcccgctggcc^ggtcctgtcacctttgtctttcccgcgcctgcgacaac^ccgcgctggttgacg 
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cttgctgtacgagtcaccgacratccgttggtggtt^ 

gtggattgccaccttgtcgaacagtagacgaagttcgcgcacaatttggcgcggcgttcccggttgtgcctggtgaaacgggg 
gggcgtttaaatccttcagaaatccgcgatgccctgacgggtgaactgtttcgacaggggtaa (SEQ ID NO: 25) 

11) yhbC 

gtgggcttgtccacattagagcaaaaattaacagagatgattactgcgccagttgaggccctgggttttgaactggttggcatcg 

aatttattcgcggtcgcacatccacactgcgcatctatattgatagtgaagatggcatcaatgttgatgattgtgctgatgtgagcc 

accaggtaagtgctgtgctggatgttgaagatcccatcaccgttgcttataacctggaagtctcctcaccgggtctcgatcgccc 

actgttcacggctgaacactacgcccgttttgtcggagaagaggtgactctggttctccgtatggcggtacaaaaccgtcgtaa 

atggcagggcgttatcaaagcggtagacggtgaaatgatcacagttaccgtcgaaggtaaagatgaagtgttcgcgctgagt 

aatatccagaaggcgaacctggttccccacttttaa (SEQ ID NO: 26) 

12) ygbP 

atggcaaccactcatttggatgtttgcgccgtggttccggcggccggatttggccgtcgaatgcaaacggaatgtcctaagcaa 

tatctctcaatcggtaatcaaaccattcttgaacactcggtgcatgcgctgctggcgcatccccgggtgaaacgtgtcgtcattg^ 

cataagtcctggcgatagccgttttgcacaacttcctctggcgaatcatccgcaaatcaccgttgtagatggcggtgatgagcgt 

gccgattccgtgctggcaggtctgaaagccgctggcgacgcgcagtgggtattggtgcatgacgccgctcgtccttgtttgcato 

aggatgacctcgcgcgattgttggcgttgagcgaaaccagccgcacgggggggatcctcgccgcaccagtgcgcgatacta 

tgaaacgtgccgaaccgggcaaaaatgccattgctcataccgttgatcgcaacggcttatggcacgcgctgacgccgcaattt 

ttccctcgtgagctgttacatgactgtctgacgcgcgctctaaatgaaggcgcgactattaccgacgaagcctcggcgctggaa 

tattgcggattccatcctcagttggtcgaaggccgtgcggataacattaaagtcacgcgcccggaagatttggcactggccga 

gttttacctcacccgaaccatccatcaggagaatacataa (SEQ ID NO: 27) 

13) ybeY 

atgagtcaggtgatcctcgatttacaactggcatgtgaagataattccgggttaccggaagagagccagtttcagacatggctg 
aatgcggtgatcccgcagtttcaggaagaatcggaagtgacgattcgcgtggtcgataccgccgaaagccacagtctgaatc 
tgacctatcgcggtaaggataagccgaccaacgtgctctccttcccgtttgaagtgccgcctggcatggaaatgtcgctactg^ 
gcgatctggttatctgccgtcaggtggttgagaaggaagctcaggagcaaggcaaaccactggaggcgcactgggcgcata 
tggtggtgcacggcagtctgcatttgttaggttacgatcacatcgaagatgacgaagcagaagaaatggaagccctcgaaac 
agagattatgcttgctctgggctatgaggatccgtacattgccgagaaagaataa (SEQ ID NO: 28) 

14) gcpE 

atgcataaccaggctccaattcaacgtagaaaatcaacacgtatttacgttgggaatgtgccgattggcgatggtgctcccatc 

gccgtacagtccatgaccaatacgcgtacgacagacgtcgaagcaacggtcaatcaaatcaaggcgctggaacgcgttgg 

cgctgatatcgtccgtgtatccgtaccgacgatggacgcggcagaagcgttcaaactcatcaaacagcaggttaacgtgccg 

ctggtggctgacatccacttcgactatcgcattgcgctgaaagtagcggaatacggcgtcgattgtctgcgtattaa 

atatcggtaatgaagagcgtattcgcatggtggttgactgtgcgcgcgataaaaacattccgatccgtattggcgttaacgccgg 

atcgctggaaaaagatctgcaagaaaagtatggcgaaccgacgccgcaggcgttgctggaatctgccatgcgtcatgttgat 

c^tctcgatcgcctgaacttcgatcagttcaaagtcagcgtgaaagcgtctgacgtcttcctcgctgtt^ 

caaaacagatcgatcagccgttgcatctggggatcaccgaagccggtggtgcgcgcagcggggcagtaaaatccgccattg 

gtttaggtctgctgctgtctgaaggcatcggcgacacgctgcgcgtatcgctggcggccgatccggtcgaagagatcaaagtc 

ggtttc^ataWtgaaatcgctgcgtatccgttcgcgagggatcaacttcatcgcctgcccgacctgttcgcgtcaggaam 

ttatcggtacggttaacgcgctggagcaacgcctggaagatatcatcactccgatggacgtttcgattatcggctgcgtggtgaa 

tggcccaggtgaggcgctggtttctacactcggcgtcaccggcggcaacaagaaaagcggcctctatgaagatggcgtgcg 

caaagaccgtctggacaacaacgatatgatcgaccagctggaagcacgcattcgtgcgaaagccagtcagctggacgaa 

gcgcgtcgaattgacgttcagcaggttgaaaaataa 

(SEQ ID NO: 29) 

15) kdtB 

atgcaaaaacgggcgatttatccgggtactttcgatcccattaccaatggtcatatcgatatcgtgacgcgcgccacgcagatgt 
tcgatcacgttattctggcgattgccgccagccccagtaaaaaaccgatgtttaccctggaagagcgtgtggcactggcacag 
caggcaaccgcgcatctggggaacgtggaagtggtcgggtttagtgatttaatggcgaacttcgcccgtaatcaacacgctac 
ggtgctgattcgtggcctgcgtgcggtggcagattttgaatatgaaatgcagctggcgcatatgaatcgccacttaatgccggaa 
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ctggaaagtgtgtttctgatgccgtcgaaagagtggtcgtttatctcttcatcgttggtgaaagaggtggcgcgcratcaggg 
atgtcacccatttcctgccggagaatgtccatcaggcgctgatggcgaagttagcgtag (SEQ ID NO: 30) 

16) pfs 

atgaaaatcggcatcattggtgcaatggaagaagaagttacgctgctgcgtgacaaaatcgaaaaccgtcaaactatcagtc 
tcggcggttgcgaaatctataccggccaactgaatggaaccgaggttgcgcttctgaaatcgggcatcggtaaagtcgctgcg 
gcgctgggtgcractttgctgttggaacactgraagccagatgtgattatte^ 

tgaaagtgggcgatatcgttgtctcggacgaagcacgttatcacgacgcggatgtcacggcatttggttatgaatacggtcagtt 

accaggctgtccggcaggctttaaagctgacgataaactgatcgctgccgctgaggcctgcattgccgaactgaatcttaacg 

ctgtacgtggcctgattgttagcggcgacgctttcatcaacggttctgttggtctggcgaaaatccgccacaacttcccacaggc 

cattgctgtagagatggaagcgacggcaatcgcccatgtctgccacaatttcaacgtcccgtttgttgtcgtacgcgccatctcc 

gacgtggccgatcaacagtctcatcttagcttcgatgagttcctggctgttgccgctaaacagtccagcctgatggttgagtcact 

ggtgcagaaacttgcacatggctaa (SEQ ID NO: 31) 

17) ycaJ 

gtgagcaatctgtcgctcgatttttcggataatacttttcaacctctggccgcgcgtatgcggccagaaaatttagcacagtatatc 

ggccagcaacatttgctggctgcggggaagccgttgccgcgcgctatcgaagccgggcatttacattctatgatcctctggggg 

ccgccgggtaccggcaaaacaactctcgctgaagtgattgcccgctatgcgaacgctgatgtggaacgtatttctgccgtcac 

ctctggcgtgaaagagattcgcgaggcgatcgagcgcgcccggcaaaaccgcaatgcaggtcgccgcactattctttttgttg 

acgaagttcaccgtttcaacaaaagccagcaggatgcatttctgccacatattgaagacggcaccatcacttttattggcgcaa 

ccactgaaaacccgtcgtttgagcttaattcggcactgctttcccgtgcccgtgtctatctgttgaaatccctgagtacag 

ttgagcaagtactaactcaggcgatggaagacaaaacccgtggctatggtggtcaggatattgttctgccagatgaaacacg 

acgcgccattgctgaactggtgaatggcgacgcgcgccgggcgttaaatacgctggaaatgatggcggatatggccgaagt 

cgatgatagcggtaagcgggtcctgaagcctgaattactgaccgaaatcgccggtgaacgtagcgcccgctttgataacaaa 

ggcgatcgcmtacgatctgamccgcactgcataagtcggtacgtggtagcgcacccgatgcggcgctgtactggtat^ 

gaattattaccgctggtggcgatccgttatatgtcgcgcgtcgctgtctggcgattgcgtctgaagacgtcggtaatgccgatcca 

cgggcgatgcaggtggcaattgcggcctgggattgctttactcgcgttggcccggcggaaggtgaacgcgccattgctcaggc 

gattgtttacctggcctgcgcgccaaaaagcaacgctgtctacactgcgtttaaagccgcgctggccgatgctcgcgaacgcc 

cggattatgacgtgccggttcatttgcgtaatgcgccgacgaaattaatgaaggaaatgggctacgggcaggaatatcgttac 

gctratgatgaagcaaacgcttatgctgccggtgaggmacttcccgccggaaatagcawaacacgctattatttcccgaca 

aacaggggccttgaaggcaagattggcgaaaagctcgcctggctggctgaacaggatcaaaatagccccataaaacgcta 

ccgttaa (SEQ ID NO: 32) 

18) b1808 

gtgacggacgattttgcaccagacggtcagctggcgaaagcgataccaggctttaagccgcgagaaccacagcgacagat 

ggcggtagccgtcacccaggcgatagaaaaaggccagccgctggtggtggaagcaggaaccggtacgggcaaaaccta 

cgcttacctggctcctgcgctgcgggcgaaaaagaaagtcattatctcgaccggctcaaaagcgttgcaggatc^gctctaca 

gccgcgatttgccaacagtctcaaaggcattgaaatatacgggcaacgtggcgctgctgaaagggcgctcaaactacctctg 

cctcgaacgtctcgaacagcaggcgctggcggggggcgatctgccggtac^aatcttaagcgatgtgatcctgctgcgctcct 

ggtctaatcaaacagtcgatggtgatatcagcacctgcgtcagcgtggcggaagattcacaggcgtggccgctggtcaccag 

caccaacgacaactgtcttggcagcgactgcccgatgtataaagattgctttgtggtcaaagcacgtaaaaaagcgatggac 

gccgatgtggtggtggtaaaccatcatctctttctggcggatatggtggttaaagagagtggatttggcgaactgatcccggaag 

cggacgtcatgatcttcgacgaagcccaccagctaccggac^ttgccagccagtattttggtcagtcactctccagtcgaca 

tgctcgacctggcaaaagacatcaccatcgcctaccgcaccgaattaaaagacacccagcagttacaaaagtgcgctgatc 

gtcttgccragagtgcgcaggattttcgtctgcaactcggtgagccaggttatcgcggtaacctgcgtgagctgttagctaatcc^ 

caaattcagcgggcatttttactgctcgatgacaccctggaactttgttatgacgtggcgaaactgtcactggggcgttccgccttg 

ctggatgcggcatttgagcgcgccacgttgtatcgcacacggctgaagcggctaaaagagatcaatcagccgggctacagct 

actggtacgaatgracttcgcgcrattttactctggctct^ 

aaaacccggtagctggatcttcacctcagc^acgctgtcggtgaac^acgatctgcatcatttcacctcgcggctt^ 

acaggccgagtcgttgctgttgcccagccraffigattacagccgcra^ 

aaccagccaggttctgctcgcragttagcggcaatgctgcgacc^^ 

ctcgcacgccatgatgcgcgatctggccgagcagttccgcgctaccatgacgcttcctgtattgttgcagggggaaacca 

aagggcaactgttgcagraatttgtcagcgccggtaatgcgcttcttgtggc^accagcagtttctgggaaggggtggacgtgc 

gtggcgatacattgtcattggtaattatcgacaaattgccgtttacctcgccggatgatccactgttaaaagcgcgcatggaagat 
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tgtcgtttgcgcggtggcgacccgttcgatgaagtgcaactaccagatgccgtcattactctcaaacagggggtagggcgact 
gattcgcgacgcc^acgatcgtggcgtgctggtgatttgtgacaatcggctggtgatgcgtccttacggcgcgacgtttc^ 
agtctgccgcccgcgccacgcacccgtgacattgcccgtgcggttcgtttccttgcgataccatcctccaggtaa (SEQ ID 
NO: 33) 

19) yeaA 

atggctaataaaccttcggcagaagaactgaaaaaaaatttgtccgagatgcagttttacgtgacgcagaatcatgggacag 
aaccgccamacgggtcgtttactgcataacaagcgtgacggcgtatatcactgtttgatctgcgatgccccgctgtttrattcc^ 
aaaccaagtatgattccggctgtggctggcccagtttctacgaaccggtaagtgaagaatccattcgttatatcaaagacttgtc 
acatggaatgcagcgcatagaaattcgttgcggtaactgtgatgcccatctggggcatgtcttccccgacgggccgcagccaa 
cgggcgaacgttattgtgttaactctgcctctttacgctttaccgatggcgaaaacggcgaagaaatcaacggttga (SEQ 
ID NO: 34) 

20) yagF 

atgaccattgagaaaattttcaccccgcaggacgacgcgttttatgcggtgatcacccacgcggcggggccgcagggcgctc 

tgccgctgaccccgcagatgctgatggaatctcccagcggcaacctgttcggcatgacgcagaacgccgggatgggctggg 

acgccaacaagctcaccggcaaagaggtgctgattatcggcactcagggcggcatccgcgccggagacggacgcccaat 

cgcgctgggctaccacaccgggcattgggagatcggcatgcagatgcaggcggcggcgaaggagatcacccgcaatggc 

gggatcccgttcgcggccttcgtcagcgatccgtgcgacgggcgctcgcagggcacgcacggtatgttcgattccctgccgta 

ccgcaacgacgcggcgatcgtgtttcgccgcctgatccgctccctgccgacgcggcgggcggtgatcggcgtagcgacctg 

cgataaagggctgcccgccaccatgattgcgctggccgcgatgcacgacctgccgactattctggtgccgggcggggcgac 

gctgccgccgaccgtcggggaagacgcgggcaaggtgcagaccatcggcgcgcgtttcgccaaccacgaactctccctgc 

aggaggccgccgaactgggctgtcgcgcctgcgcctcgccgggcg^ 

caggtggtcgcggaggcgctgggtctggcgctgccgcactccgcgctggcgccgtccgggcaggcggtgtggctggagatc 

gcccgccagtcggcgcgcgcggtcagcgagctggatagccgcggcatcacc^cgcgggatatcctctccgataaagcca 

cgaaaacgcgatggtgatccacgcggcgttcggcggctccaccaatttactgctgcacattccggccatcgcccacgcggcg 

ggctgcacgatcccggacgttgagcactggacgcgcatcaaccgtaaagtgccgcgtctggtgagcgtgctgcccaacggc 

ccggactatcacccgaccgtgcgcgccttcctcgcgggcggcgtgccggaggtgatgctccacctgcgcgacctcggcctgc 

tgcatctggacgccatgaccgtgaccggccagacggtgggcgagaaccttgaatggtggcaggcgtccgagcgccgggcg 

cgcttccgccagtgcctgcgcgagcaggacggcgtagagccggatgacgtgatcctgccgccggagaaggcaaaagcga . 

aagggctgacctcgacggtctgcttcccgacgggcaacatcgctccggaaggttcggtgatcaaggccacggcgatcgacc 

cgtcggtggtgggcgaagatggcgtataccaccacaccggccgggtgcgggtgtttgtctcggaagcgcaggcgatcaagg 

cgatcaagcgggaagagattgtgcagggcgatatcatggtggtgatcggcggcgggccgtccggcaccggcatggaagag 

acctaccagctcacctccgcgctaaagcatatctcgtggggcaagacggtgtcgctcatcaccgatgcgcgcttctcgggcgt 

gtcgacgggcgcctgcttcggccacgtgtcgccggaggcgctggcgggcgggccgattggcaagctgcgcgataacgaca 

tcatcgagattgccgtggatcgtctgacgttaactggcagcgtgaacttcatcggcaccgcggacaacccgctgacgccgga 

agagggcgcgcgcgagctggcgcggcggcagacgcacccggacctgcacgcccacgactttttgccggacgacacccg 

gctgtgggcggcactgcagtcggtgagcggcggcacctggaaaggctgtatttatgacaccgataaaattatcgaggtaatta 

acgccggtaaaaaagcgctcggaatttaa (SEQ ID NO: 35) 

21) b1983 

gtgggacgtaaatgggccaatattgttgctaaaaaaacggctaaagacggtgcaacgtctaaaatttatgcaaaattcggtgta 

gaaatctatgctgctgctaaacaaggtgaacccgatccagaattaaacacatctttaaaattcgttattgaacgtgcaaagcag 

gcacaagttccaaagcacgttattgataaagcaattgataaagccaaaggcggcggagatgaaacgttcgtgcagggacgt 

tatgaaggctttggtcctaatggctcaatgattatcgccgagacattgacttcaaatgttaaccgtacgattgctaacgttcgcara 

attttraataaaaaaggcggraatatcggagcggcaggttctgtcagctatatgtttga^atacgggtgtgattgtatttaaagg 

gacagaccctgaccatatttttgaaattttacttgaagctgaagttgatgttcgtgatgtgactgaagaagaaggtaacattgttatt 

tatactgaacctactgaccttcataaaggaatcgcggctctaaaagcagctggaatcactgagttctcaacaacagaattaga 

aatgattgctcaatctgaagttgagctttccccagaagatttagaaatctttgaagggcttgttgatgcccttgaagatgacgacg 

atgtacaaaaagtttatcataacgtcgcaaatctctaa (SEQ ID NO: 36) 

22) yidD 

atggcgccgccactgtcgcctggctcgcgggtcctgatagccctcattcgggtctatcaacgcctgattagtccgctactcgggc 
cgcattgtcgtttcactccaacctgttcaagctacggaattgaggcattgcgcaggtttggagtgataaaaggcagttggttgacg 

Fig. 1 continued 
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gtgaaacgc^tattaaaatgccacccmacaccctggtggtgacgatcccgtcccgcccggaccatttgataccagagaaca 
ctaa (SEQ ID NO: 37) 



23) yceG 

atgaaaaaagtgttattgataatcttgttattgctggtggtactgggtatcgccgctggtgtgggcgtctggaaggttcgccatcttg 

ccgacagcaaattgcttatcaaagaagagacgatatttaccctgaagccagggaccggacgtctggcgctcggtgaacagc 

tttatgccgataagatcatcaatcgtccacgggtttttcaatggctgctgcgtatcgaaccggatcmctcactttaaagccgggac 

ttaccgctttacaccgcagatgaccgtgcgcgagatgctgaaattgctggaaagcggtaaagaagcacagttccctctgcgac 

tggtagaagggatgcgtctgagcgattacctcaagcaattgcgtgaggccccgtatatcaagcatacgctgagcgatgataag 

tacgccaccgtagcgcaggcacttgaactggaaaacccggagtggattgaaggttggttctggccagacacctggatgtata 

ccgccaataccaccgatgtcgcgttactcaagcgagcgcacaagaaaatggtgaaagcggtcgatagcgcctgggaagg 

gcgtgcggacggtctgccttataaagataaaaaccagttggtgacgatggcatcaattatcgaaaaagaaaccgccgttgcc 

agtgaacgcgataaggttgcctcagtatttatcaaccgtttacgcattggtatgcgcctgcagaccgacccgaccgtgatttacg 

ggatgggagagcgttataatggcaaactttctcgtgcagacctggaaacgccgacagcgtataacacctataccattaccggt 

ctgccgccaggtgcgatagcgacgccgggggcggattcgctgaaggctgctgcgcatccggcaaaaacgccgtatctctatt 

ttgtggccgatggtaaaggtggtcacacgmaataccaatcttgccagtc^taacaagtctgtgcaggattatctgaaagtgc^ 

aaggaaaaaaatgcgcagtaa 

(SEQ ID NO:38) 

24) yjbC 

atgctgcccgactcatcagtccgtttaaataaatacatcagcgaaagcggaatttgctcacgccgcgaagcggatcgctatatc 

gagcaaggcaatgtgttccttaatggcaagcgagccaccattggcgatcaggtgaaacccggcgacgttgtgaaagtaaac 

ggtcagttgattgaacctcgggaagccgaagatttggtacttatcgccctgaacaagcccgttggtattgtaagcaccaccgaa 

gatggcgagcgcgataacattgtcgatttcgttaaccacagcaaacgcgtgttcccgattggccgcctggataaagactccca 

ggggctgattttcctc^ccaatcacggcgatctggtgaataagatcctgcgtgctggcaatgatratgagaaagagtatctggt^ 

acggtcgataaaccgattaccgaggagtttattcgcggcatgagtgcgggggtgccaatcctcgggacagtgaccaaaaagt 

gcaaagttaaaaaagaagcgccgtttgtcttccgcattaccctggtgcaggggctgaaccgtcagatccggcgcatgtgcga 

gcatttcggctatgaagtgaaaaagctggaacgcacgcgcatcatgaacgttagcttaagcggcattccgctgggggaatgg 

cgcgatttaaccgacgatgagttaatcgacctctttaagctcattgaaaattcctcttccgaggtaaaacctaaagcgaaggcc 

aaaccgaaaacagcgggcatcaaacgtccagtcgttaagatggaaaaaacggcggaaaaaggcggtcgcccggcgtcc 

aacggtaagcgttttacctcgccggggcgtaaaaagaaggggcgctga (SEQ IDNO:39) 
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SEQUENCE LISTING 

<110> GPC Biotech AG 

<120> Novel method for identifying antibacterial compounds 

<130> D 1400 PCT 

<140> 
<141> 

<160> 45 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 1 

atggttataa atttggagtg tgaaggttat tgcgtgtaa 39 



<210> 2 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 2 

gttataaatt tggagtgtga aggttattgc gtg 33 



<210> 3 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 3 

cacgcaataa ccttcacact ccaaatttat aac 33 



<210> 4 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 4 

gatcggatcc aaattccagt tagccatgat gcggtc 36 



<210> 5 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 5 

cacgcaataa ccttcacact ccaaatttat aaccattata cacggacgct atgc 



<210> 6 
<211> 55 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 6 

gttataaatt tggagtgtga aggttattgc gtgacggatt aattttgttt ctctt 



<210> 7 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 7 

gatcgtcgac gcgctcgata tcaccgatga acaaccg 



<210> 8 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 8 

caatccgctg ctttatttct gtcag 
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<210> 9 
<211> 25 
<2I2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 9 

ttataacgaa atcaacggga aacct 25 



<210> 10 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 10 

gatcggatcc ctctaaaaaa taaggaatta aagg 34 



<210> 11 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 11 

cacgcaataa ccttcacact ccaaatttat aaccatagga tacctaatta attaac 56 



<210> 12 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 12 

gttataaatt tggagtgtga aggttattgc gtgaagagcg ccatttccca ccgt 54 



<210> 13 
<211> 36 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description cf Artificial Sequence: artificial 
sequence 

<400> 13 

gatcgtcgac tcataztgct gataacccgc tgcggt 36 



<210> 14 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description cf Artificial Sequence: artificial 
sequence 

<400> 14 

gttgacggcc aggccaacag teat 



<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 15 

ataaccctgg gcaatcgcct eg 



<210> 16 
<211> 480 
<212> DNA 

<213> Escherichia coli 



<400> 16 

atgcgaattg gacacggttt tgaegtacat gcctttggcg gtgaaggece aattatcatt 60 
ggtggcgtac gcattcctta cgaaaaagga ttgctggcgc attctgatgg cgacgtggcg 120 
ctccatgcgt tgaccgatgc attgettgge gcggcggcgc tgggggatat eggcaagctg 180 
ttcceggata ccgatccggc atttaaaggt gccgatagcc gegagctget acgcgaagcc 240 
tggcgtcgta ttcaggcgaa gggttatacc cttggcaacg tcgatgtcac tatcatcget 300 
caggcaccga agatgttgcc gcacattcca caaatgegeg tgtttattgc cgaagatctc 360 
ggctgecata tggatgatgt taacgtgaaa gccactacta eggaaaaact gggatttacc 420 
ggacgtgggg aagggattgc ctgtgaagcg gtggcgctac tcattaaggc aacaaaatga 4 80 



<210> 17 
<211> 537 
<212> DNA 

<213> Escherichia coli 



<400> 17 

atgcgccgcg cttttataac eggagtttte 
tactggatgc gtcacgcgct gacgctggcg 
gteggegegg tattagtgea taacaategg 
ggtcgccatg atcccaccgc acatgeagaa 



tttttgtctg aagtcgaatt tagecacgaa 60 

aaacgtgcct gggatgagcg ggaagtgccg 120 

gtaatcggcg aaggctggaa ccgcccgatt 180 

atcatggccc tgcggcaggg tggtccggtg 240 
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atgcaaaatt atcgtctgat cgacgccacg ttgtatgtca cgcttgaacc atgtgtaatg 300 

tgtgccggag cgatgatcca cagtcgcatt ggtcgcgtgg tctttggtgc gcgcgacgcg 360 

tttaatggat gtgctgcatc atccgggtat gaatcaccga 420 

actggcggat gagtgcgcgg cgttgctcag tgacttcttt 



aaaactggcg ctgcgggatc 
gtggaaatta cggaaggaat 



cgcatgcgcc gccaggaaat taaagcgcag aaaaaagcgc aatcctcgac ggattaa 



480 
537 



<210> 18 
<211> 621 
<212> DNA 

<213> Escherichia coli 
<400> 18 

atgaggtata tagttgcctt aacgggaggc attggcagtg gcaagagtac cgttgccaat 60 
gcgtttgctg atctcggaat taacgtcatt gatgccgata ttattgcgcg tcaggtggtt 120 
gaaccaggtg cacctgcgct acatgccatt gctgatcact ttggcgctaa catgattgct 180 
gctgatggaa cattgcagcg ccgggccttg cgcgagcgga tcttcgccaa cccggaagag 240 
aaaaactggc ttaacgccct gctgcatccg ctgattcagc aagagacgca acaccagatc 300 
cagcaagcta cttcccccta tgtactgtgg gttgtgccat tgctggtaga aaactcactg 360 
tataaaaaag cgaatcgagt gcttgtggtg gatgtcagcc cagaaacgca acttaagcgc 420 
accatgcagc gcgatgatgt aactcgcgag catgrcgaac aaatccttgc tgctcaggca 4 80 
acgcgcgaag cccgccttgc cgtggcagat gacgtcattg ataataacgg cgcaccggat 540 
gctatcgcat cggatgttgc ccgcctgcac gcacactatt tgcagcttgc gtcgcagttt 600 
gtctcacagg aaaaaccgta a 621 



<210> 19 
<211> 852 
<212> DNA 

<213> Escherichia coli 



<400> 19 

atgcggacac 

cgtgcggatg 

atcagcattg 

gaacatgaag 

agcgggcgtc 

ggcggcggtc 

ctctggcaat 

gatgttcctg 

acgccggtgg 

actccggtga 

acgttgctaa 

gaggttgatg 

ggggcctgtg 

gccccggaat 

gccatgcttt 



agtggccctc 
gttaccacac 
agcttcgtga 
ataacctgat 
ttccgacggg 
tcggcggtgg 
gcgggctaag 
tctttgttcg 
atccgccaga 
tttttaaaga 
aatgtgaatt 
cggtgctttc 
tctttgctga 
ggctcaatgg 
aa 



tccggcaaaa 
gctgcaaacg 
cgatggggat 
cgttcgcgca 
aagcggtgcg 
ttcatccaat 
catggatgag 
ggggcatgcc 
gaagtggtat 
tcctgaactc 
cagcaatgat 
ctggctgtta 
atttgataca 
ctttgtggcg 



cttaatctgt 
ctgtttcagt 
attcgtctgt 
gcgcgattgt 
aatatcagca 
gccgcgacgg 
ctggcggaaa 
gcgtttgccg 
ctggtggcgc 
ccgcgcaata 
tgcgaggtta 
gaatacgccc 
gagtctgaag 
aaaggcgcta 



ttttatacat 
ttcttgatta 
taacgcccgt 
tgatgaaaac 
.ttgacaagcg 
tcctggtggc 
tggggctgac 
aaggcgttgg 
accctggtgt 
cgccaaaaag 
tcgcaagaaa 
cgtcgcgcct 
cccgccaggt 
atctttcccc 



taccggtcag 60 
cggcgacacc 120 
tgaaggcgtg 180 
tgcggcagac 240 
tttgccgatg 300 
attaaatcat 360 
gctgggcgca 420 
tgaaatacta 480 
aagtattccg 540 
gtcaatagaa 600 
acgttttcgc 660 
gactgggaca 720 
gctagagcaa 780 
attgcacaga 840 
852 



<210> 20 
<211> 696 
<212> DNA 

<213> Escherichia coli 
<400> 20 

atgcgacttg ataaatttat cgcacagcaa ctcggcgtta gccgtgctat tgccgggcgt 60 
gaaatccgcg gcaatcgtgt caccgtcgat ggcgaaatcg tccgtaatgc agcgttcaaa 120 
ctgcttcctg aacatgatgt cgcttacgat ggcaacccgc tggcgcagca acacggtcca 180 
cgttacttca tgctcaataa gcctcagggc tatgtttgct ccacggacga ccctgatcac 240 
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ccaacggtgc 
ttggatattg 
attacttctc 
gctgacgata 
actaagcctg 
gggcgttatc 
catcgtgaac 
ccgttaactg 



tctartttct 
ataccaccgg 
cgcgccatca 
cggcagagca 
cggtgctgga 
atcaggtgaa 
gtattggcgg 
aagaagaaat 



tgatgaaccg 
tctggtgctg 
ttgcgagaag 
atttgctaaa 
agtgattacc 
acgcatgttc 
tattacgctg 
tgccagcgtc 



gtagcgtgga 
atgactgatg 
acctatctgg 
ggcgtgcagc 
ccaacgcagg 
gccgccgtgg 
gatgctgatt 
gtctaa 



aactgcatgc 
atggtcagtg 
tgacactgga 
tgcataacga 
ttcgtctgac 
gtaaccacgt 
tagcccccgg 



ggcggggcgg 300 
gtcgcaccgc 360 
atcacctgta 420 
aaaagatctc 480 
catcagcgaa 540 
ggtrgagctg 600 
tgaatatcgt 660 
696 



<210> 21 
<211> 885 
<212> DNA 

<213> Escherichia coli 



<400> 21 

atgattatgc 

ggcgaactgg 

cagcccgtta 

ctgaagtttg 

gttattaacg 

ccagaaaatg 

ccgagcgaag 

tgcctggaag 

ggcgacgtag 

aatgcccagc 

gaactgctga 

acggtttacg 

gatgcgctga 

attgacatgc 

gaaatccgca 



cgcaacatga 
taaccgtttc 
aaaacgtgct 
atggtgatat 
gtaacaataa 
ccgacctgaa 
gcgaacgcta 
attactttat 
acggcaaacc 
aggacgactt 
ccttaccggc 
atccgcagga 
aaacgctgcc 
attgtgatta 
acaacgcgtc 



ccaattacat 
ggaaaccctg 
ggcagaactg 
caccgtacag 
ccagcagatg 
aacgctggtc 
tcagggcgta 
gcgttctgaa 
ggctgcaggc 
tgaccacctg 
aaacgaagtg 
tgtggagttc 
tgatgaagaa 
ctgcggtaac 
tccggcagat 



cgctatctgt 
caacagatcc 
ctggttgcga 
ctgcagggcg 
cgcggtgtgg 
ggcaatggtt 
gttggtctgg 
cagctgccga 
ggtatgttgt 
gcgacgctaa 
ttgtggcgtt 
aaatgcacct 
gttgatagca 
cactatctgt 
ccgcaagttc 



ttgaaaactt 
ttgagaacca 
ccagcctgtt 
acggtccgat 
cgcgcgtgca 
acgtggtgat 
aaggtgatac 
cgcgcctgtt 
tgcaggtaat 
ccgaaaccat 
tgtatcacga 
gctcgcgtga 
tcctggcgga 
tcaatgcgat 
attaa 



tgccgtgcgc 60 
cgattatccg 120 
aaccgctacg 180 
gaatctggcg 240 
gggcgaaatt 300 
caccattacc 360 
cctggcggcc 420 
tattcgcacc 480 
gcctgcgcaa 54 0 
caaaaccgaa 600 
agaagaggtg 660 
acgttgcgcc 720 
agatggcgaa 780 
ggatattgct 840 
885 



<210> 22 
<211> 759 
<212> DNA 

<213> Escherichia coli 



<400> 22 

gtggggagac gacgcggatt tttaactatg cgtatccccc gcatttatca tcctgaacca 60 

ctgaccagcc attctcacat cgcgctttgc gaagatgccg ccaaccatat cgggcgcgta 120 

ctgcgcatgg ggccggggca ggcgttgcaa ttgtttgacg gtagcaacca ggtctttgac 180 

gccgaaatta ccagcgccag caaaaaaagc gtggaagtga aggtgctgga aggccagatc 24 0 

gacgatcgcg aatctccgct gcatattcac ctcggtcagg tgatgtcgcg tggtgaaaaa 300 

atggaattta ctatccagaa atcgatcgaa ctcggtgtaa gcctcattac gccacttttt 360 

tctgagcgct gcggcgttaa actggatagt gaacgtctga acaagaagct tcagcagtgg 420 

cagaagattg caattgctgc ctgtgagcag tgtggtcgta accgggtgcc ggaaatccgt 4 80 

ccagcgatgg atctggaagc ctggtgtgca gagcaggatg aaggactgaa actgaatctt 54 0 

cacccgcgcg ccagtaacag catcaatacg ttgccgttac cggttgaacg cgtccgcctg 600 

ctgattggcc cggaaggcgg tttatcggca gatgaaattg ccatgactgc ccgctatcaa 660 

tttactgata tcctgttggg acctcgcgtt ttgcgtacag agacaactgc gctcaccgcc 720 
attaccgcgc tacaagtacg atttggcgat ttgggctaa 759 



<210> 23 
<211> 462 
<212> DNA 

<213> Escherichia coli 
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<400> 23 

atgatgaatc 

gtagcgaaag 

aaaaccacct 

cccacttata 

ttgtaccgcc 

aacgatgcca 

gatgtcgaaa 

tcctctgccg 



gagtaattcc 
cctgcgatgg 
ttagccgggg 
cgctggtcga 
ttgccgatcc 
tctgcctggt 
tacacattga 
gtgaattgtt 



gctccctgat 
cgcaaccgta 
ctttttacag 
accctatacg 
cgaggagctg 
ggagtggcca 
ttatcaggca 
gctggcgcgt 



gagcaggcaa 
atctatctgt 
gctctgggtc 
ctcgacaact 
gagtttatgg 
caacaaggta 
caaggccgtg 
ttacccggtt 



cattagacct 
atggcgattt 
atcagggtaa 
taatggtcta 
ggatccgcga 
caggtgttct 
acgcgcgcgt 
aa 



gggcgagcgg 60 
aggcgcaggt 120 
tgtcaaaagc 180 
tcactttgat 240 
ttattttgcc 300 
tcctgacccg 360 
gagtgcggtt 420 
4 62 



<210> 24 
<21I> 987 
<212> DNA 

<213> Escherichia coli 
<400> 24 

atgaaattac gctctgtaac ctacgcatta 
tcttctctgg cggcacaatc tttacgtttc 
catattgcgg cgaaaaaatt caatgattta 
ttaaaactgt tcccggacag cactctcggt 
ggcggcacca tcgatatgga aatgtccggc 
atgaacttgc ttgatgtccc tttcctgttc 
gacggcaaag tcggtgatga tctgaaagcc 
gcctactggg aaaacggctg gcgcgatgtc 
gccgacctga aagggctgaa aatccgcacc 
aaagtctttg gcgctaaccc gatcccgatg 
acccgcacta tcgacgctca ggaacacccg 
gaagtgcaga agttcctttc tctgacgcac 
aacaaagcga agtttgatgg cttaagtccg 
caagaagcgg gtaactatca gcgcaaactg 
ggcatgaaag aagcgggcgt ggaagtcatc 
gcactgggga atcaggttcg cgacatgttt 
ctgaaagccg tggatgaggt gcaataa 



ttcattgctg gcctggctgc attcagcaca 60 
ggttatgaaa catcacaaac cgactcgcaa 120 
ttgcaggaga gaaccaaagg cgagctgaaa 180 
aacgcgcagg cgatgatcag cggcgtacgt 240 
tcgaataact ttgccgggtt atcaccagtg 300 
cgcgataccg ctcacgcgca taaaacgctc 360 
tcacttgaag gtaaaggact gaaagtactg 420 
accaactcgc gcgcaccggt taaaaccccc 480 
aacaatagcc cgatgaatat cgccgcattc 540 
ccgtttgccg aagtctatac cgggctggaa 600 
atcaacgtcg tctggtcagc aaaatttttc 660 
cacgcctatt ccccgcttct ggtggtgatc 720 
gagttccagc aggcgctagt ttcatctgca 780 
gttgctgaag atcagcaaaa aatcatcgac 840 
accgatctcg accgcaaagc ctttagcgac 900 
gttaaagatg tgccgcaggg agctgatctg 960 

987 



<210> 25 
<211> 573 
<212> DNA 

<213> Escherichia coli 
<400> 25 

gtgaataata acctgcaaag agacgctatc 
cgtgtcatcg cctatccaac ggaagccgtt 
acagcagtga tgcgactgtt ggagttaaaa 
atcgcagcaa attacgagca gcttaaaccc 
cagcgtgaaa ccattttttc ccgctggcca 
gcgacaacac cgcgctggtt gacgggccgc 
catccgttgg tggttgcttt gtgccaggct 
aacttgagtg gattgccacc ttgtcgaaca 
gcgttcccgg ttgtgcctgg tgaaacgggg 
gccctgacgg gtgaactgtt tcgacagggg 



gcagctgcga tagatgttct caatgaagaa 60 
ttcggtgttg ggtgcgatcc tgatagcgaa 120 
cagcgtccgg ttgataaggg gctgatttta 180 
tatattgatg acaccatgtt gactgacgtg 240 
ggtcctgtca cctttgtctt tcccgcgcct 300 
tttgattcgc ttgctgtacg agtcaccgac 360 
tatggtaaac cgctggtttc taccagtgcc 420 
gtagacgaag ttcgcgcaca atttggcgcg 480 
gggcgtttaa atccttcaga aatccgcgat 540 
taa 573 



<210> 26 
<211> 459 
<212> DNA 

<213> Escherichia coli 
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8. 

<400> 26 

gtgggcttgt ccacattaga gcaaaaatta acagagatga ttactgcgcc agttgaggcc 60 

ctgggttttg aactggttgg catcgaattt attcgcggtc gcacatccac actgcgcatc 120 

tatattgata gtgaagatgg catcaatgtt gatgattgtg ctgatgtgag ccaccaggta 180 

agtgctgtgc tggatgttga agatcccatc accgttgctt ataacctgga agtctcctca 240 

ccgggtctcg atcgcccact gttcacggct gaacactacg cccgttttgt cggagaagag 300 

gtgactctgg ttctccgtat ggcggtacaa aaccgtcgta aatggcaggg cgttatcaaa 360 

gcggtagacg gtgaaatgat cacagttacc gtcgaaggta aagatgaagt gttcgcgctg 420 

agtaatatcc agaaggcgaa cctggttccc cacttttaa 459 



<210> 27 
<211> 711 
<212> DNA 

<213> Escherichia coli 



<400> 27 

atggcaacca 

atgcaaacgg 

tcggtgcatg 

ggcgatagcc 

ggcggtgatg 

tgggtattgg 

ttggcgttga 

atgaaacgtg 

tggcacgcgc 

gctctaaatg 

catcctcagt 

gcactggccg 



ctcatttgga 
aatgtcctaa 
cgctgctggc 
gttttgcaca 
agcgtgccga 
tgcatgacgc 
gcgaaaccag 
ccgaaccggg 
tgacgccgca 
aaggcgcgac 
tggtcgaagg 
agttttacct 



tgtttgcgcc 
gcaatatctc 
gcatccccgg 
acttcctctg 
ttccgtgctg 
cgctcgtcct 
ccgcacgggg 
caaaaatgcc 
atttttccct 
tattaccgac 
ccgtgcggat 
cacccgaacc 



gtggttccgg 
tcaatcggta 
gtgaaacgtg 
gcgaatcatc 
gcaggtctga 
tgtttgcatc 
gggatcctcg 
attgctcata 
cgtgagctgt 
gaagcctcgg 
aacattaaag 
atccatcagg 



cggccggatt 
atcaaaccat 
tcgtcattgc 
cgcaaatcac 
aagccgctgg 
aggatgacct 
ccgcaccagt 
ccgttgatcg 
tacatgactg 
cgctggaata 
tcacgcgccc 
agaatacata 



tggccgtcga 60 
tcttgaacac 120 
cataagtcct 180 
cgttgtagat 240 
cgacgcgcag 300 
cgcgcgattg 360 
gcgcgatact 420 
caacggctta 480 
tctgacgcgc 540 
ttgcggattc 600 
ggaagatttg 660 
a 711 



<210> 28 
<211> 468 
<212> DNA 

<213> Escherichia coli 
<400> 28 

atgagtcagg tgatcctcga tttacaactg gcatgtgaag ataattccgg gttaccggaa 60 
gagagccagt ttcagacatg gctgaatgcg gtgatcccgc agtttcagga agaatcggaa 120 
gtgacgattc gcgtggtcga taccgccgaa agccacagtc tgaatctgac ctatcgcggt 180 
aaggataagc cgaccaacgt gctctccttc ccgtttgaag tgccgcctgg catggaaatg 240 
tcgctactgg gcgatctggt tatctgccgt caggtggttg agaaggaagc tcaggagcaa 300 
ggcaaaccac tggaggcgca ctgggcgcat atggtggtgc acggcagtct gcatttgtta 360 
ggttacgatc acatcgaaga tgacgaagca gaagaaatgg aagccctcga aacagagatt 420 
atgcttgctc tgggctatga ggatccgtac attgccgaga aagaataa 4 68 



<210> 29 
<211> 1119 
<212> DNA 

<213> Escherichia coli 
<400> 29 

atgcataacc aggctccaat tcaacgtaga aaatcaacac gtatttacgt tgggaatgtg 60 
ccgattggcg atggtgctcc catcgccgta cagtccatga ccaatacgcg tacgacagac 120 
gtcgaagcaa cggtcaatca aatcaaggcg ctggaacgcg ttggcgctga tatcgtccgt 180 
gtatccgtac cgacgatgga cgcggcagaa gcgttcaaac tcatcaaaca gcaggttaac 240 
gtgccgctgg tggctgacat ccacttcgac tatcgcattg cgczgaaagt agcggaatac 300 
ggcgtcgatt gtctgcgtat taaccctggc aatatcggta atgaagagcg tattcgcatg 360 
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gtggttgact 
ctggaasaag 
gccatgcgtc 
aaagcgtctg 
cagccgttgc 
gccattggtt 
gcggccgatc 
cgttcgcgag 
atcggtacgg 
tcgattarcg 
accggcggca 
aacaacgata 
gaagcgcgtc 



gtgcgcgcga 
atctgcaaga 
atgttgatca 
acgtcttcct 
atctggggat 
taggtctgct 
cggtcgaaga 
ggatcaactt 
ttaacgcgct 
gctgcgtggt 
acaagaaaag 
tgatcgacca 
gaattgacgt 



taaaaacatt 
aaagtatggc 
tctcgatcgc 
cgctcttgag 
caccgaagcc 
gctgtctgaa 
gatcaaagtc 
catcgcctgc 
ggagcaacgc 
gaatggccca 
cggcctctat 
gctggaagca 
tcagcaggtt 



ccgatccgta 
gaaccgacgc 
ctgaacttcg 
tcttatcgtt 
ggtggtgcgc 
ggcatcggcg 
ggtttcgata 
ccgacctgtt 
ctggaagata 
ggtgaggcgc 
gaagatggcg 
cgcattcgtg 
gaaaaataa 



ttggcgttaa 
cgcaggcgtt 
atcagttcaa 
tgctggcaaa 
gcagcggggc 
acacgctgcg 
ttttgaaatc 
cgcgtcagga 
tcatcactcc 
tggtttctac 
tgcgcaaaga 
cgaaagccag 



cgccggatcg 
gctggaatct 
agtcagcgtg 
acagatcgat 
agtaaaatcc 
cgtatcgctg 
gctgcgtatc 
atttgatgtt 
gatggacgtt 
actcggcgtc 
ccgtctggac 
tcagctggac 



420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1119 



<210> 30 
<211> 480 
<212> DNA 

<213> Escherichia coli 
<400> 30 

atgcaaaaac gggcgattta tccgggtact ttcgatccca ttaccaatgg tcatatcgat 60 
atcgtgacgc gcgccacgca gatgttcgat cacgttattc tggcgattgc cgccagcccc 120 
agtaaaaaac cgatgtttac cctggaagag cgtgtggcac tggcacagca ggcaaccgcg 180 
catctgggga acgtggaagt ggtcgggttt agtgatttaa tggcgaactt cgcccgtaat 24 0 
caacacgcta cggtgctgat tcgtggcctg cgtgcggtgg cagattttga atatgaaatg 300 
cagctggcgc atatgaatcg ccacttaatg ccggaactgg aaagtgtgtt tctgatgccg 360 
tcgaaagagt ggtcgtttat ctcttcatcg ttggtgaaag aggtggcgcg ccatcagggc 4 20 
gatgtcaccc atttcctgcc ggagaatgtc catcaggcgc tgatggcgaa gttagcgtag 480 



<210> 31 
<211> 699 
<212> DNA 

<213> Escherichia coli 



<400> 31 

atgaaaatcg 

gaaaaccgtc 

accgaggttg 

ttgctgttgg 

gcaccaacgt 

gatgtcacgg 

gctgacgata 

gtacgtggcc 

atccgccaca 

gtctgccaca 

caacagtctc 

atggttgagt 



gcatcattgg 
aaactatcag 
cgcttctgaa 
aacactgcaa 
tgaaagtggg 
catttggtta 
aactgatcgc 
tgattgttag 
acttcccaca 
atttcaacgt 
atcttagctt 
cactggtgca 



tgcaatggaa 
tctcggcggt 
atcgggcatc 
gccagatgtg 
cgatatcgtt 
tgaatacggt 
tgccgctgag 
cggcgacgct 
ggccattgct 
cccgtttgtt 
cgatgagttc 
gaaacttgca 



gaagaagtta 
tgcgaaatct 
ggtaaagtcg 
attattaaca 
gtctcggacg 
cagttaccag 
gcctgcattg 
ttcatcaacg 
gtagagatgg 
gtcgtacgcg 
ctggctgttg 
catggctaa 



cgctgctgcg 
ataccggcca 
ctgcggcgct 
ccggttctgc 
aagcacgtta 
gctgtccggc 
ccgaactgaa 
gttctgttgg 
aagcgacggc 
ccatctccga 
ccgctaaaca 



tgacaaaatc 60 
actgaatgga 120 
gggtgccact 180 
cggtggcctg 240 
tcacgacgcg 300 
aggctttaaa 360 
tcttaacgct 420 
tctggcgaaa 480 
aatcgcccat 540 
cgtggccgat 600 
gtccagcctg 660 
699 



<210> 32 
<211> 1344 
<212> DNA 

<213> Escherichia coli 
<400> 32 

gtgagcaatc tgtcgctcga tttttcggat aatacttttc aacctctggc cgcgcgtatg 60 
cggccagaaa atttagcaca gtatatcggc cagcaacatt tgctggctcc ggggaagccg 120 
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ttgccgcgcg 
accggcaaaa 
atttctgccg 
aaccgcaatg 
agccagcagg 
accactgaaa 
ctgttgaaat 
aaaacccgtg 
gctgaactgg 
atggccgaag 
gccggtgaac 
gcactgcata 
attattaccg 
gaagacgtcg 
tttactcgcg 
gcctgcgcgc 
cgcgaacgcc 
aaggaaatgg 
gccggtgagg 
aggggccttg 
agccccataa 



ctatcgaagc 
caactctcgc 
tcacctctgg 
caggtcgccg 
atgcatttct 
acccgtcgtt 
ccctgagtac 
gctatggtgg 
tgaatggcga 
tcgatgatag 
gtagcgcccg 
agtcggtacg 
ctggtggcga 
gtaatgccga 
ttggcccggc 
caaaaagcaa 
cggattatga 
gctacgggca 
tttacttccc 
aaggcaagat 
aacgctaccg 



cgggcattta 
tgaagtgatt 
cgtgaaagag 
cactattctt 
gccacatatt 
tgagcttaat 
agaggatatt 
tcaggatatt 
cgcgcgccgg 
cggtaagcgg 
ctttgataac 
tggtagcgca 
tccgttatat 
tccacgggcg 
ggaaggtgaa 
cgctgtctac 
cgtgccggtt 
ggaatatcgt 
gccggaaata 
tggcgaaaag 
ttaa 



10 

cattctatga 
gcccgctatg 
attcgcgagg 
tttgttgacg 
gaagacggca 
tcggcactgc 
gagcaagtac 
gttctgccag 
gcgttaaata 
gtcctgaagc 
aaaggcgatc 
cccgatgcgg 
gtcgcgcgtc 
atgcaggtgg 
cgcgccattg 
actgcgttta 
catttgcgta 
tacgctcatg 
gcacaaacac 
ctcgcctggc 



tcctctgggg 
cgaacgctga 
cgatcgagcg 
aagttcaccg 
ccatcacttt 
tttcccgtgc 
taactcaggc 
atgaaacacg 
cgctggaaat 
ctgaattact 
gcttttacga 
cgctgtactg 
gctgtctggc 
caattgcggc 
ctcaggcgat 
aagccgcgct 
atgcgccgac 
atgaagcaaa 
gctattattt 
tggctgaaca 



gccgccgggt 
tgtggaacgt 
cgcccggcaa 
tttcaacaaa 
tattggcgca 
ccgtgtctat 
gatggaagac 
acgcgccatt 
gatggcggat 
gaccgaaatc 
tctgatttcc 
gtatgcgcga 
gattgcgtct 
ctgggattgc 
tgtttacctg 
ggccgatgct 
gaaattaatg 
cgcttatgct 
cccgacaaac 
ggatcaaaat 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1344 



<210> 33 
<211> 1911 
<212> DNA 

<213> Escherichia coli 



<400> 33 

gtgacggacg attttgcacc 
cgagaaccac agcgacagat 
ctggtggtgg aagcaggaac 
cgggcgaaaa agaaagtcat 
agccgcgatt tgccaacagt 
aaagggcgct caaactacct 
gatctgccgg tacaaatctt 
gtcgatggtg atatcagcac 
gtcaccagca ccaacgacaa 
gtggtcaaag cacgtaaaaa 
tttctggcgg atatggtggt 
gtcatgatct tcgacgaagc 
tcactctcca gtcgacaact 
gaattaaaag acacccagca 
gattttcgtc tgcaactcgg 
aatccgcaaa ttcagcgggc 
gtggcgaaac tgtcactggg 
ttgtatcgca cacggctgaa 
tacgaatgca cttcgcgcca 
ttcaaagagt taatggcgca 
gtgaacgacg atctgcatca 
ctgttgccca gcccatttga 
ccgcaaacca accagccagg 
gaagctaaca acggtcgttg 
gccgagcagt tccgcgctac 
gggcaactgt tgcagcaatt 
ttctgggaag gggtggacgt 
ccgtttacct cgccggatga 
ggtggcgacc cgttcgatga 
gtagggcgac tgattcgcga 



agacggtcag ctggcgaaag 
ggcggtagcc gtcacccagg 
cggtacgggc aaaacctacg 
tatctcgacc ggctcaaaag 
ctcaaaggca ttgaaatata 
ctgcctcgaa cgtctcgaac 
aagcgatgtg atcctgctgc 
ctgcgtcagc gtggcggaag 
ctgtcttggc agcgactgcc 
agcgatggac gccgatgtgg 
taaagagagt ggatttggcg 
ccaccagcta ccggacattg 
gctcgacctg gcaaaagaca 
gttacaaaag tgcgctgatc 
tgagccaggt tatcgcggta, 
atttttactg ctcgatgaca 
gcgttccgcc ttgctggatg 
gcggctaaaa gagatcaatc 
ttttactctg gctctcacgc 
aaaacccggt agctggatct 
tttcacctcg cggcttggca 
ttacagccgc caggcgttac 
ttctgctcgc cagttagcgg 
ttttatgctt tgtacctcgc 
catgacgctt cctgtattgt 
tgtcagcgcc ggtaatgcgc 
gcgtggcgat acattgtcat 
tccactgtta aaagcgcgca 
agtgcaacta ccagatgccg 
cgccgacgat cgtggcgtgc 



cgataccagg ctttaagccg 60 
cgatagaaaa aggccagccg 120 
cttacctggc tcctgcgctg 180 
cgttgcagga tcagctctac 240 
cgggcaacgt ggcgctgctg 300 
agcaggcgct ggcggggggc 360 
gctcctggtc taatcaaaca 420 
attcacaggc gtggccgctg 480 
cgatgtataa agattgcttt 54 0 
tggtggtaaa ccatcatctc 600 
aactgatccc ggaagcggac 660 
ccagccagta ttttggtcag 720 
tcaccatcgc ctaccgcacc 780 
gtcttgccca gagtgcgcag 8 40 
acctgcgtga gctgttagct 900 
ccctggaact ttgttatgac 960 
cggcatttga gcgcgccacg 1020 
agccgggcta cagctactgg 1080 
cgctcagcgt ggcggataaa 1140 
tcacctcagc aacgctgtcg 1200 
tcgaacaggc cgagtcgttg 1260 
tctgtgtgct gcgcaatctg 1320 
caatgctgcg accgatcatc 1380 
acgccatgat gcgcgatctg 1440- 
tgcaggggga aaccagcaaa 1500 
ttcttgtggc aaccagcagt 1560 
tggtaattat cgacaaattg 1620 
tggaagattg tcgtttgcgc 1680 
tcattactct caaacagggg 1740 
tggtgatttg tgacaatcgg 1800 
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ctggtgatgc gtccttacgg cgcgacgttt ctcgccagtc tgccgcccgc gccacgcacc 1860 
cgtgacattg cccgtgcggt tcgtttcctt gcgataccat cctccaggta a 1911 



<210> 34 
<211> 414 
<212> DNA 

<213> Escherichia coli 



<400> 34 

atggctaata 

gtgacgcaga 

gacggcgtat 

gattccggct 

atcaaagact 

catctggggc 

tctgcctctt 



aaccttcggc 
atcatgggac 
atcactgttt 
gtggctggcc 
tgtcacatgg 
atgtcttccc 
tacgctttac 



agaagaactg 
agaaccgcca 
gatctgcgat 
cagtttctac 
aatgcagcgc 
cgacgggccg 
cgatggcgaa 



aaaaaaaatt 
tttacgggtc 
gccccgctgt 
gaaccggtaa 
atagaaattc 
cacccaacgg 
aacggcgaag 



tgtccgagat 
gtttactgca 
ttcattccca 
gtgaagaatc 
gttgcggtaa 
gcgaacgtta 
aaatcaacgg 



gcagttttac 60 
taacaagcgt 120 
aaccaagtat 180 
cattcgttat 240 
ctgtgatgcc 300 
ttgtgttaac 360 
ttga 414 



<210> 35 
<211> 1968 
<212> DNA 

<213> Escherichia coli 



<400> 35 

atgaccattg 

gcggcggggc 

ggcaacctgt 

ggcaaagagg 

atcgcgctgg 

aaggagatca 

gggcgctcgc 

atcgtgtttc 

acctgcgata 

actattctgg 

gtgcagacca 

ctgggctgtc 

acctcgcagg 

tccgggcagg 

gatagccgcg 

gtgatccacg 

cacgcggcgg 

ccgcgtctgg 

ctcgcgggcg 

gacgccatga 

gagcgccggg 

gtgatcctgc 

acgggcaaca 

gtgggcgaag 

caggcgatca 

ggcggcgggc 

catatctcgt 

acgggcgcct 

ctgcgcgata 

aacttcatcg 

cggcggcaga 

tgggcggcac 

aaaattatcg 



agaaaatttt 
cgcagggcgc 
tcggcatgac 
tgctgattat 
gctaccacac 
cccgcaatgg 
agggcacgca 
gccgcctgat 
aagggctgcc 
tgccgggcgg 
tcggcgcgcg 
gcgcctgcgc 
tggtcgcgga 
cggtgtggct 
gcatcaccac 
cggcgttcgg 
gctgcacgat 
tgagcgtgct 
gcgtgccgga 
ccgtgaccgg 
cgcgcttccg 
cgccggagaa 
tcgctccgga 
atggcgtata 
aggcgatcaa 
cgtccggcac 
ggggcaagac 
gcttcggcca 
acgacatcat 
gcaccgcgga 
cgcacccgga 
tgcagtcggt 
aggtaattaa 



caccccgcag 
tctgccgctg 
gcagaacgcc 
cggcactcag 
cgggcattgg 
cgggatcccg 
cggtatgttc 
ccgctccctg 
cgccaccatg 
ggcgacgctg 
tttcgccaac 
ctcgccgggc 
ggcgctgggt 
ggagatcgcc 
gcgggatatc 
cggctccacc 
cccggacgtt 
gcccaacggc 
ggtgatgctc 
ccagacggtg 
ccagtgcctg 
ggcaaaagcg 
aggttcggtg 
ccaccacacc 
gcgggaagag 
cggcatggaa 
ggtgtcgctc 
cgtgtcgccg 
cgagattgcc 
caacccgctg 
cctgcacgcc 
gagcggcggc 
cgccggtaaa 



gacgacgcgt 
accccgcaga 
gggatgggct 
ggcggcatcc 
gagatcggca 
ttcgcggcct 
gattccctgc 
ccgacgcggc 
attgcgctgg 
ccgccgaccg 
cacgaactct 
ggcgggtgtc 
ctggcgctgc 
cgccagtcgg 
ctctccgata 
aatttactgc 
gagcactgga 
ccggactatc 
cacctgcgcg 
ggcgagaacc 
cgcgagcagg 
aaagggctga 
atcaaggcca 
ggccgggtgc 
attgtgcagg 
gagacctacc 
atcaccgatg 
gaggcgctgg 
gtggatcgtc 
acgccggaag 
cacgactttt 
acctggaaag 
aaagcgctcg 



tttatgcggt 
tgctgatgga 
gggacgccaa 
gcgccggaga 
tgcagatgca 
tcgtcagcga 
cgtaccgcaa 
gggcggtgat 
ccgcgatgca 
tcggggaaga 
ccctgcagga 
.agttcctcgg 
cgcactccgc 
cgcgcgcggt 
aagccatcga 
tgcacattcc 
cgcgcatcaa 
acccgaccgt 
acctcggcct 
ttgaatggtg 
acggcgtaga 
cctcgacggt 
cggcgatcga 
gggtgtttgt 
gcgatatcat 
agctcacctc 
cgcgcttctc 
cgggcgggcc 
tgacgttaac 
agggcgcgcg 
tgccggacga 
gctgtattta 
gaatttaa 



gatcacccac 
atctcccagc 
caagctcacc 
cggacgccca 
ggcggcggcg 
tccgtgcgac 
cgacgcggcg 
cggcgtagcg 
cgacctgccg 
cgcgggcaag 
ggccgccgaa 
cacggcgggc 
gctggcgccg 
cagcgagctg 
aaacgcgatg 
ggccatcgcc 
ccgtaaagtg 
gcgcgccttc 
gctgcatctg 
gcaggcgtcc 
gccggatgac 
ctgcttcccg 
cccgtcggtg 
ctcggaagcg 
ggtggtgatc 
cgcgctaaag 
gggcgtgtcg 
gattggcaag 
tggcagcgtg 
cgagctggcg 
cacccggctg 
tgacaccgat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1968 
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<21C> 36 
<211> 717 
<212> DNA 

<213> Escherichia coii 



<400> 36 

gtgggacgta 

aaaatttatg 

ccagaattaa 

aagcacgtta 

cagggacgtt 

tcaaatgtta 

atcggagcgg 

gggacagacc 

gtgactgaag 

atcgcggctc 

gctcaatctg 

gcccttgaag 



aatgggccaa 
caaaattcgg 
acacatcttt 
ttgataaagc 
atgaaggctt 
accgtacgat 
caggttctgt 
ctgaccatat 
aagaaggtaa 
taaaagcagc 
aagttgagct 
atgacgacga 



tattgttgct 
tgtagaaatc 
aaaattcgtt 
aattgataaa 
tggtcctaat 
tgctaacgtt 
cagctatatg 
ttttgaaatt 
cattgttatt 
tggaatcact 
ttccccagaa 
tgtacaaaaa 



aaaaaaacgg 
tatgctgctg 
attgaacgtg 
gccaaaggcg 
ggctcaatga 
cgcacaattt 
tttgacaata 
ttacttgaag 
tatactgaac 
gagttctcaa 
gatttagaaa 
gtttatcata 



ctaaagacgg 
ctaaacaagg 
caaagcaggc 
gcggagatga 
tratcgccga 
tcaataaaaa 
cgggtgtgat 
ctgaagttga 
ctactgacct 
caacagaatt 
tctttgaagg 
acgtcgcaaa 



tgcaacgtct 60 
tgaacccgat 120 
acaagttcca 180 
aacgttcgtg 240 
gacattgact 300 
aggcggcaat 360 
tgtatttaaa 420 
tgttcgtgat 480 
tcataaagga 540 
agaaatgatt 600 
gcttgttgat 660 
tctctaa 717 



<210> 37 
<211> 258 
<212> DNA 

<213> Escherichia coli 
<400> 37 

atggcgccgc cactgtcgcc tggctcgcgg 
cgcctgatta gtccgctact cgggccgcat 
ggaattgagg cattgcgcag gtttggagtg 
gtattaaaat gccacccttt acaccctggt 
gataccagag aacactaa 



gtcctgatag ccctcattcg ggtctatcaa 60 
tgtcgtttca ctccaacctg ttcaagctac 120 
ataaaaggca gttggttgac ggtgaaacgc 180 
ggtgacgatc ccgtcccgcc cggaccattt 240 

258 



<210> 38 
<211> 1023 
<212> DNA 

<213> Escherichia coli 



<400> 38 

atgaaaaaag 

gtgggcgtct 

atatttaccc 

aagatcatca 

cactttaaag 

ttgctggaaa 

agcgattacc 

aagtacgcca 

ttctggccag 

gcgcacaaga 

ccttataaag 

gttgccagtg 

cgcctgcaga 

tctcgtgcag 

ccaggtgcga 

acgccgtatc 

gccagtcata 

taa 



tgttattgat 
ggaaggttcg 
tgaagccagg 
atcgtccacg 
ccgggactta 
gcggtaaaga 
tcaagcaatt 
ccgtagcgca 
acacctggat 
aaatggtgaa 
ataaaaacca 
aacgcgataa 
ccgacccgac 
acctggaaac 
tagcgacgcc 
tctattttgt 
acaagtctgt 



aatcttgtta 
ccatcttgcc 
gaccggacgt 
ggtttttcaa 
ccgctttaca 
agcacagttc 
gcgtgaggcc 
ggcacttgaa 
gtataccgcc 
agcggtcgat 
gttggtgacg 
ggttgcctca 
cgtgatttac 
gccgacagcg 
gggggcggat 
ggccgatggt 
gcaggattat 



ttgctggtgg 
gacagcaaat 
ctggcgctcg 
tggctgctgc 
ccgcagatga 
cctctgcgac 
ccgtatatca 
ctggaaaacc 
aataccaccg 
agcgcctggg 
atggcatcaa 
gtatttatca 
gggatgggag 
tataacacct 
tcgctgaagg 
aaaggtggtc 
ctgaaagtgc 



tactgggtat 
tgcttatcaa 
gtgaacagct 
gtatcgaacc 
ccgtgcgcga 
tggtagaagg 
agcatacgct 
cggagtggat 
atgtcgcgtt 
aagggcgtgc 
ttatcgaaaa 
accgtttacg 
agcgttataa 
ataccattac 
ctgctgcgca 
acacgtttaa 
ttaaggaaaa 



cgccgctggt 
agaagagacg 
ttatgccgat 
ggatctttct 
gatgctgaaa 
gatgcgtctg 
gagcgatgat 
tgaaggttgg 
actcaagcga 
ggacggtctg 
agaaaccgcc 
cattggtatg 
tggcaaactt 
cggtctgccg 
tccggcaaaa 
taccaatctt 
aaatgcgcag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1023 
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<210> 39 
<211> 873 
<212> DNA 

<213> Escherichia coli 
<400> 39 

atgctgcccg actcatcagt ccgtttaaat 
cgccgcgaag cggatcgcta tatcgagcaa 
accattggcg atcaggtgaa acccggcgac 
cctcgggaag ccgaagattt ggtacttatc 
accaccgaag atggcgagcg cgataacatt 
ttcccgattg gccgcctgga taaagactcc 
gatctggtga ataagatcct gcgtgctggc 
gtcgataaac cgattaccga ggagtttatt 
gggacagtga ccaaaaagtg caaagttaaa 
ctggtgcagg ggctgaaccg tcagatccgg 
aaaaagctgg aacgcacgcg catcatgaac 
tggcgcgatt taaccgacga tgagttaatc 
tccgaggtaa aacctaaagc gaaggccaaa 
gttaagatgg aaaaaacggc ggaaaaaggc 
acctcgccgg ggcgtaaaaa gaaggggcgc 



13 



aaatacatca gcgaaagcgg aatttgctca 60 
ggcaatgtgt tccttaatgg caagcgagcc 120 
gttgtgaaag taaacggtca gttgattgaa 180 
gccctgaaca agcccgttgg tattgtaagc 240 
gtcgatttcg ttaaccacag caaacgcgtg 300 
caggggctga ttttcctcac caatcacggc 360 
aatgatcatg agaaagagta tctggtgacg 420 
cgcggcatga gtgcgggggt gccaatcctc 480 
aaagaagcgc cgtttgtctt ccgcattacc 540 
cgcatgtgcg agcatttcgg ctatgaagtg 600 
gttagcttaa gcggcattcc gctgggggaa 660 
gacctcttta agctcattga aaattcctct 720 
ccgaaaacag cgggcatcaa acgtccagtc 780 
ggtcgcccgg cgrccaacgg taagcgtttt 840 
tga 873 



<210> 40 
<211> 159 
<212> PRT 

<213> Escherichia coli 
<400> 40 

Met Arg He Gly His Gly Phe Asp Val His Ala Phe Gly Gly Glu Gly 
1 5 10 15 

Pro He lie He Gly Gly Val Arg He Pro Tyr Glu Lys Gly Leu Leu 
20 25 30 

Ala His Ser Asp Gly Asp Val Ala Leu His Ala ..Leu Thr Asp Ala Leu 
35 40 45 

Leu Gly Ala Ala Ala Leu Gly Asp He Gly Lys Leu Phe Pro Asp Thr 
50 55 60 

Asp Pro Ala Phe Lys Gly Ala Asp Ser Arg Glu Leu Leu Arg Glu Ala 
65 70 75 80 

Trp Arg Arg He Gin Ala Lys Gly Tyr Thr Leu Gly Asn Val Asp Val 
85 90 95 

Thr He He Ala Gin Ala Pro Lys Met Leu Pro His He Pro Gin Met 
100 105 110 

Arg Val Phe He Ala Glu Asp Leu Gly Cys His Met Asp Asp Val Asn 
115 120 125 

Val Lys Ala Thr Thr Thr Glu Lys Leu Gly Phe Thr Gly Arg Gly Glu 
130 135 140 



Gly He Ala Cys Glu Ala Val Ala Leu Leu He Lys Ala Thr Lys 
145 150 155 



WO 00/61793 



14 



PCT/EP00/03135 



<210> 41 

<211> 158 

<212> PRT 

<213> Haemophilus influenzae 

<400> 41 

Met lie Arg He Gly His Gly Phe Asp Val His Ala Phe Gly Glu Asp 
1- 5 10 15 

Arg Pro Leu He He Gly Gly Val Glu Val Pro Tyr His Thr Gly Phe 
20 25 30 

He Ala His Ser Asp Gly Asp Val Ala Leu His Ala Leu Thr Asd Ala 
35 40 45 

He Leu Gly Ala Ala Ala Leu Gly Asp He Gly Lys Leu Phe Pro Asp 
50 55 60 

Thr Asp Met Gin Tyr Lys Asn Ala Asp Ser Arg Gly Leu Leu Arg Glu 
65 70 75 80 

Ala Phe Arg Gin Val Gin Glu Lys Gly Tyr Lys He Gly Asn Val Asp 
85 90 95 

He Thr He He Ala Gin Ala Pro Lys Met Arg Pro His lie Asp Ala 
100 105 110 

Met Arg Ala Lys He Ala Glu Asp Leu Gin Cys Asp He Glu Gin Val 
115 120 125 

Asn Val Lys Ala Thr Thr Thr Glu Lys Leu Gly Phe Thr Gly Arg Gin 
130 135 140 

Glu Gly He Ala Cys Glu Ala Val Ala Leu Leu He Arg Gin 
145 150 155 



<210> 42 
<211> 158 
<212> PRT 
<213> Bacillus , 

<400> 42 
Met Phe Arg He 
1 

Arg Pro Leu He 
20 

Leu Gly His Ser 
35 

Cys Leu Gly Ala 
50 

Thr Asp Pro Glu 
65 



;ubtilis 

Gly Gin Gly Phe 
5 

He Gly Gly He 

Asp Ala Asp Val 
40 

Val Gly Glu Gly 
55 

Phe Lys Asp Ala 
70 



Asp Val His Gin 
10 

Glu He Pro Tyr 
25 

Leu Leu His Thr 



Asp He Gly Lys 
60 

Asp Ser Phe Lys 
75 



Leu Val Glu Gly 
15 

Glu Lys Gly Leu 
30 

Val Ala Asp Ala 
45 

His Phe Pro Asp 



Leu Leu Gin His 
80 
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Val Trp Gly He Val 
85 

Cys Thr He He Ala 
100 

Met Arg Lys Arg He 
115 

Asn Val Lys Ala Thr 
130 

Glu Gly lie Ala Ala 
145 



Lys Gin Lys Gly Tyr Val 
90 

Gin Lys Pro Lys Met Leu 
105 

Ala Glu Gly Leu Glu Ala 
120 

Thr Thr Glu Lys Leu Gly 
135 

Gin Ala Thr Val Leu He 
150 155 



Leu Gly Asn lie- Asp 
95 

Pro Tyr He Glu Asp 
110 

Aso Val Ser Gin Val 
125 

Phe Thr Gly Arg Ala 
140 

Gin Lys Gly 



<210> 43 
<211> 161 
<212> PRT 

<213> Synechocystis sp . 
<400> 43 

Met Thr Ala Leu Arg He Gly Asn Gly Tyr Asp lie His Arg Leu Val 
15 10 15 

Gly Asp Arg Pro Leu lie Leu Gly Gly Val Thr lie Ala His His Leu 
20 25 30 

Gly Leu Asp Gly His Ser Asp Ala Asp Val Leu Thr His Ala Leu Met 
35 40 45 

Asp Ala Leu Leu Gly Ala Leu Ser Leu Gly Asp lie Gly His Tyr Phe 
50 55 60 

Pro Pro Ser Asp Ala Arg Trp Gin Gly Ala Asp Ser Leu Lys Leu Leu 
65 70 75 80 

Ala Gin Val His Gin Leu He Leu Glu Arg Gly Trp Arg lie Asn Asn 
85 90 95 

Leu Asp Asn Val lie Val Ala Glu Gin Pro Lys Leu Lys Pro His lie 
100 105 110 

Gin Ala Met Lys Glu Asn Leu Ala Lys Val Leu Thr lie Asp Pro Asp 
115 120 125 

Leu lie Gly lie Lys Ala Thr Thr Asn Glu Arg Leu Gly Pro Thr Gly 
130 135 140 

Arg Glu Glu Gly lie Ala Ala Tyr Ser Val Ala Leu Leu lie Lys Glu 
145 150 155 160 

Gly 



<210> 44 
<21i> 399 
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<212> PRT 

<213> Treponema pallidum 
<400> 44 

Met Arg Arg Gly Gly Ala Cys Val Gin Lys Lys Glu Tyr Leu Pro Leu 
15 10 15 

Thr Ser Arg Gin Pro Gly Val Cys Leu Leu Ser Glu He Leu Val Arg 
20 25 30 

Ala Leu Glu Ala Arg Ser Phe Phe Leu Val Val Val Thr Val Pro Ala 
35 40 45 

Gly Glu Val Ala Tyr Ala Glu Ser Gin Val Ala Cys Asp Ser Arg Leu 
50 55 60 

Ser Ala Phe Pro Ser Arg Thr Arg Pro Val He Leu Tyr Val Pro Gly 
65 70 75 80 

Ala His Thr Arg Ser Ala Ser Val Arg Ala Gly Leu Asp Ala Met Ala 
85 90 95 

Thr His Ala Pro Asp Val Val Leu Val His Asp Gly Ala Arg Pro Phe 
100 105 110 

Val Ser Val Ala Leu He His Ser Val Leu Glu Ala Thr Cys Arg Tyr 
115 120 125 

Gly Ala Ala Val Pro Val He Glu Ala Thr Asp Thr Pro Lys Gly Val 
130 135 140 

Ala Ala Asp Gly Ser He Glu Thr His Leu He Arg Ser Arg Val Arg 
145 150 155 160 

Leu Ala Gin Thr Pro Gin Gly Phe Cys Tyr Ala Ser Leu Cys Ala Ala 
165 170 175 

His His Arg Ala Ala Thr Asp Gly Glu Gin Tyr Thr Asp Asp Ser Glu 
180 185 190 

Leu Tyr Ala Arg Tyr Gly Gly Thr Val His Val Cys Ala Gly Glu Arg 
195 200 205 

Ser Asn Val Lys He Thr Tyr Pro Glu Asp Leu Glu Gin Arg Ala Ser 
210 215 220 

Glu Pro Ala Leu Thr Arg Gly He Ser Val Leu Pro Cys Thr Glu Glu 
225 230 235 240 

Gly Ala Leu Arg Val Gly Leu Gly Thr Asp Met His Ala Leu Cys Ala 
245 250 255 

Gly Arg Pro Leu He Leu Ala Gly He His He Pro Ser Lys Lys Gly 
260 265 270 

Ala Gin Gly His Ser Asp Ala Asp Val Leu Ala His Ala Ser He Asp 
275 280 285 

Ala Leu Leu Gly Ala Ala Gly Leu Gly Asp lie Gly Thr Phe Phe Pro 
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290 295 300 

Ser Cys Aso Gly Arg Trp Lys Asp Ala His Ser Cys Ala Leu Leu Arg 
305 " 310 315 320 

His Thr Trp Gin Leu Val Arg Ala Ala Cys Trp Arg Leu Val Asn Leu 
325 ' 330 335 

Asp Ala Val Val Cys Leu Glu Gin Pro Ala Leu His Pro Phe Arg Glu 
340 345 350 

Ala Met Arg Ala Ser Leu Ala Gin Ala Leu Asp Thr His Val Thr Arg 
355 360 365 

Val Phe Val Lys Ala Lys Thr Ala Glu Arg Leu Gly Pro Val Gly Ser 
370 375 380 

Gly Ala Ala Val Thr Ala Gin Val Val Val Leu Leu Lys Lys lie 
385 390 395 



<210> 45 
<211> 406 
<212> PRT 

<213> Helicobacter pylori 
<400> 45 

Met Ser Leu He Arg Val Asn Gly Glu Ala Phe Lys Leu Ser Leu Glu 
15 10 15 

Ser Leu Glu Glu Asp Pro Phe Glu Thr Lys Glu Thr Leu Glu Thr Leu 
20 25 30 

He Lys Gin Thr Ser Val Val Leu Leu Ala Ala Gly Glu Ser Arg Arg 
35 40 45 

Phe Ser Gin Thr He Lys Lys Gin Trp Leu Arg Ser Asn His Thr Pro 
50 55 60 

Leu Trp Leu Ser Val Tyr Glu Ser Phe Lys Glu Ala Leu Asp Phe Lys 
65 70 75 80 

Glu He He Leu Val Val Ser Glu Leu Asp Tyr He Tyr He Lys Arg 
85 90 95 

His Tyr Pro Glu He Lys Leu Val Lys Gly Gly Ala Ser Arg Gin Glu 
100 105 110 

Ser Val Arg Asn Ala Leu Lys He He Asp Ser Ala Tyr Thr Leu Thr 
115 120 125 

Ser Asd Val Ala Arg Gly Leu Ala Asn He Glu Ala Leu Lys Asn Leu 
130 135 140 

Phe Leu Thr Leu Gin Gin Thr Ser His Tyr Cys He Ala Pro Tyr Leu 
145 150 155 160 

Pro Cys Tyr Asp Thr Ala He Tyr Tyr Asn Glu Ala Leu Asp Arg Glu 
165 170 175 
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Ala lie Lys Leu He Gin Thr Pro Gin Leu Ser His Thr Lys Ala Leu 
180 185 190 

Gin Ser Ala Leu Asn Gin Gly Asp Phe Lys Asp Glu Ser Ser Ala He 
195 200 205 

Leu Gin Ala Phe Pro Asp Arg Val Ser Tyr He Glu Gly Ser Lys Asp 
210 215 220 

Leu His Lys Leu Thr Thr Ser Gly Asp Leu Lys His Phe Thr Leu Phe 
225 230 235 240 

Phe Asn Pro Ala Lys Asp Thr Phe He Gly Met Gly Phe Asp Thr His 
245 250 255 

Ala Phe He Lys Asp Lys Pro Met Val Leu Gly Gly Val Val Leu Asp 
260 * 265 270 

Cys Glu Phe Gly Leu Lys Ala His Ser Asp Gly Asp Ala Leu Leu His 
275 280 285 

Ala Val He Asp Ala He Leu Gly Ala He Lys Gly Gly Asp He Gly 
290 * 295 300 

Glu Trp Phe Pro Asp Asn Asp Pro Lys Tyr Lys Asn Ala Ser Ser Lys 
305 310 315 320 

Glu Leu Leu Lys He Val Leu Asp Phe Ser Gin Ser lie Gly Phe Glu 
325 330 335 

Leu Phe Glu Met Gly Ala Thr He Phe Ser Glu lie Pro Lys lie Thr 
340 345 350 

Pro Tyr Lys Pro Ala He Leu Glu Asn Leu Ser Gin Leu Leu Gly Leu 
355 360 365 

Glu Lys Ser Gin He Ser Leu Lys Ala Thr Thr Met Glu Lys Met Gly 
370 375 380 



Phe He Gly Lys Gin Glu Gly Leu Leu Val Gin Ala His Val Ser Met 
385 390 395 400 



Arg Tyr Lys Gin Lys Leu 
405 



